[CHORE] Migrate PRojection GHA/Workflow here for reusability by lowlydba · Pull Request #20 · OvertureMaps/workflows

John McCall (lowlydba) · 2026-04-14T19:51:58Z

The original location of this AI review PR Action was in a private repo, but that inherently hampers our ability to use it in other private repositories.

This will move it to the centralized place for shared workflows/actions.

overture-projection · 2026-04-14T19:52:30Z

Overture PRojection Review

This PR migrates the Overture PRojection GitHub Action/workflow into this repository for centralized, reusable access across Overture projects. It includes documentation, core scripts, and unit tests for utility functions.

✅ Checks Passed

Clear, thorough documentation for setup and configuration in .github/actions/overture-projection/README.md.
Added unit tests for utility functions in github.js and markdown.js, covering edge cases and expected behaviors.

🚩 Flags

Files not reviewed due to diff size: The following 17 files could not be reviewed because the PR is too large. Please break this PR into smaller, more focused pull requests to enable effective review:
- .github/actions/overture-projection/scripts/lib/context.js
- .github/actions/overture-projection/scripts/lib/diff.js
- .github/actions/overture-projection/scripts/lib/github.js
- .github/actions/overture-projection/scripts/lib/markdown.js
- .github/actions/overture-projection/scripts/lib/models.js
- .github/actions/overture-projection/scripts/lib/prompt.js
- .github/actions/overture-projection/scripts/lib/skills.js
- .github/actions/overture-projection/action.yml
- .github/actions/overture-projection/scripts/fetch-context.js
- .github/actions/overture-projection/scripts/fetch-diff.js
- .github/actions/overture-projection/scripts/lib/__tests__/context.test.js
- .github/actions/overture-projection/scripts/lib/__tests__/defaults.test.js
- .github/actions/overture-projection/scripts/lib/__tests__/diff.test.js
- .github/actions/overture-projection/scripts/lib/__tests__/models.test.js
- .github/actions/overture-projection/scripts/lib/__tests__/prompt.test.js
- .github/actions/overture-projection/scripts/lib/__tests__/skills.test.js
- .github/actions/overture-projection/scripts/lib/defaults.js
Linked issue missing: There is no linked issue. Per Overture process, all work should have an associated GitHub issue for visibility and tracking.

❓ Open Questions

None based on the reviewed files.

Next steps:

Please split this PR into smaller units (e.g., core action logic, tests, documentation) so each can be reviewed thoroughly.
Open and link a tracking issue for this migration, per org guidelines.

No bugs, security issues, or logic errors were found in the reviewed files. Full review of the action's implementation and additional tests will be possible once the PR is split and resubmitted.

Copilot

Pull request overview

Migrates the “Overture PRojection” AI PR review workflow/action into this centralized repo so it can be reused across other repositories, including private repos, with a composite action + reusable workflow entrypoint.

Changes:

Added a reusable workflow (workflow_call) and local pull_request trigger to run PRojection from this repo.
Added the composite action implementation (scripts + libs) for skill loading, diff fetching, skill selection, context fetching, and review posting.
Added Node node:test unit tests covering the helper libraries (diff budgeting, prompt building, models abstraction, etc.).

Reviewed changes

Copilot reviewed 24 out of 24 changed files in this pull request and generated 14 comments.

Show a summary per file

File	Description
.github/workflows/overture-projection.yml	New reusable + PR-triggered workflow wiring PRojection from this repo.
.github/actions/overture-projection/action.yml	Composite action definition orchestrating all PRojection steps.
.github/actions/overture-projection/README.md	Usage/docs for the composite action and its inputs.
.github/actions/overture-projection/scripts/load-skills.js	Loads skills from checked-out `omf-devex` skills directory.
.github/actions/overture-projection/scripts/fetch-diff.js	Fetches PR metadata/diffs and applies ignore + fetch ceiling budgeting.
.github/actions/overture-projection/scripts/select-skills.js	Calls a fast model to select which optional skills apply.
.github/actions/overture-projection/scripts/fetch-context.js	Fetches and truncates context files for selected skills only.
.github/actions/overture-projection/scripts/post-review.js	Builds prompts, dynamically budgets diffs, calls model, posts/upserts comment.
.github/actions/overture-projection/scripts/lib/skills.js	Frontmatter parsing + `pr-reviewer` surface filtering utilities.
.github/actions/overture-projection/scripts/lib/prompt.js	Pure prompt assembly utilities (system/user/preamble) + helpers.
.github/actions/overture-projection/scripts/lib/models.js	Provider abstraction for GitHub Models + Anthropic APIs.
.github/actions/overture-projection/scripts/lib/markdown.js	Markdown compression used for skills/context files.
.github/actions/overture-projection/scripts/lib/github.js	Repo/PR-number resolution helpers for cross-repo runs.
.github/actions/overture-projection/scripts/lib/diff.js	Ignore-pattern compilation + diff/context budgeting utilities.
.github/actions/overture-projection/scripts/lib/defaults.js	Centralized provider defaults for models/token budgets.
.github/actions/overture-projection/scripts/lib/context.js	Context ref parsing + per-file truncation/budgeting utilities.
.github/actions/overture-projection/scripts/lib/tests/skills.test.js	Unit tests for frontmatter parsing + skill filtering.
.github/actions/overture-projection/scripts/lib/tests/prompt.test.js	Unit tests for system/user prompt construction + flags.
.github/actions/overture-projection/scripts/lib/tests/models.test.js	Unit tests for provider normalization + rate limit logging.
.github/actions/overture-projection/scripts/lib/tests/markdown.test.js	Unit tests for markdown compression behavior.
.github/actions/overture-projection/scripts/lib/tests/github.test.js	Unit tests for repo/PR-number resolution.
.github/actions/overture-projection/scripts/lib/tests/diff.test.js	Unit tests for ignore patterns + budgeting behavior.
.github/actions/overture-projection/scripts/lib/tests/defaults.test.js	Unit tests for default values and provider defaults lookup.
.github/actions/overture-projection/scripts/lib/tests/context.test.js	Unit tests for context ref parsing + truncation + grouping.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-14T19:57:23Z

+ *   COMMENT_MODE    — 'update' | 'new' (default 'new')
+ *   COMMENT_TAG     — HTML marker for update-mode comment identification
+ *   DRY_RUN         — 'true' to skip posting and print instead
+ *   PR_NUMBER       — PR number override (falls back to event payload)
+ *   REPOSITORY      — target repo in owner/repo format (falls back to context)
+ *   RUNNER_TEMP     — standard Actions temp dir for inter-step artefacts


The env-var docs state COMMENT_MODE defaults to new, and the implementation also falls back to 'new', but the composite action input comment-mode defaults to update. Please align the documented/implemented default here with the action default (or make the divergence explicit).

Suggested change

* COMMENT_MODE — 'update' | 'new' (default 'new')

* COMMENT_TAG — HTML marker for update-mode comment identification

* DRY_RUN — 'true' to skip posting and print instead

* PR_NUMBER — PR number override (falls back to event payload)

* REPOSITORY — target repo in owner/repo format (falls back to context)

* RUNNER_TEMP — standard Actions temp dir for inter-step artefacts

* COMMENT_MODE — 'update' | 'new' (default 'update')

* COMMENT_TAG — HTML marker for update-mode comment identification

* DRY_RUN — 'true' to skip posting and print instead

* PR_NUMBER — PR number override (falls back to event payload)

* REPOSITORY — target repo in owner/repo format (falls back to context)

* RUNNER_TEMP — standard Actions temp dir for inter-step artefacts

Copilot · 2026-04-14T19:57:23Z

+  // ── Post review ──────────────────────────────────────────────────────────────
+
+  const { owner, repo } = resolveRepo(process.env.REPOSITORY, context.repo);
+  const prNumber        = resolvePrNumber(context.payload.pull_request?.number, process.env.PR_NUMBER);


resolvePrNumber(...) can return null; in that case the subsequent GitHub API calls will be made with issue_number/pull_number: null and fail with a less clear error. Add an explicit guard (setFailed + return) when prNumber is null, similar to fetch-diff.js.

Suggested change

const prNumber = resolvePrNumber(context.payload.pull_request?.number, process.env.PR_NUMBER);

const prNumber = resolvePrNumber(context.payload.pull_request?.number, process.env.PR_NUMBER);

if (prNumber === null) {

core.setFailed('Could not resolve PR number from context or PR_NUMBER');

return;

}

Copilot · 2026-04-14T19:57:23Z

+ *
+ * Env vars consumed:
+ *   MAX_FILES       — max number of changed files to include (default 20)
+  *   MAX_DIFF_CHARS  — fetch ceiling: max chars of patch content to pull from GitHub API (default 100000)


JSDoc formatting: the MAX_DIFF_CHARS env-var line is missing the leading * (it’s currently *), which breaks the comment block’s formatting. Fixing it keeps the header readable and consistent.

Suggested change

* MAX_DIFF_CHARS — fetch ceiling: max chars of patch content to pull from GitHub API (default 100000)

* MAX_DIFF_CHARS — fetch ceiling: max chars of patch content to pull from GitHub API (default 100000)

Copilot · 2026-04-14T19:57:24Z

+    # -- Step 0a: Mask secrets so they never appear in logs --------------------
+    - name: Mask secrets
+      shell: bash
+      run: |
+        if [ -n "${{ inputs['app-private-key'] }}" ]; then
+          echo "::add-mask::${{ inputs['app-private-key'] }}"
+        fi
+        if [ -n "${{ inputs['anthropic-api-key'] }}" ]; then
+          echo "::add-mask::${{ inputs['anthropic-api-key'] }}"
+        fi
+        if [ -n "${{ inputs['github-token'] }}" ]; then
+          echo "::add-mask::${{ inputs['github-token'] }}"
+        fi


::add-mask:: is being called once with the full app-private-key, which is typically a multi-line PEM. GitHub masking is line-oriented, so this may not reliably mask the key if any line is logged independently. Consider splitting on newlines and calling ::add-mask:: for each line (and/or masking a base64-encoded form) to ensure robust redaction.

Copilot · 2026-04-14T19:57:25Z

+  it('sends Bearer auth to GitHub Models', async () => {
+    let calledHeaders;
+    globalThis.fetch = async (_url, opts) => { calledHeaders = JSON.parse(opts.body); return fakeFetchResp(200, GH_SUCCESS); };
+    // headers are in opts, body has model — check Authorization separately
+    globalThis.fetch = async (_url, opts) => {
+      calledHeaders = opts.headers;
+      return fakeFetchResp(200, GH_SUCCESS);
+    };


This test assigns globalThis.fetch twice; the first assignment (parsing opts.body) is immediately overwritten and never asserted. Removing the dead assignment will make the intent clearer and avoid confusion when maintaining the test.

Copilot · 2026-04-14T19:57:26Z

+          always-skills: "pr-review"
+          repository: ${{ inputs.repository || github.event.repository.full_name }}
+          pr-number: ${{ inputs.pr_number || github.event.pull_request.number }}
+          devex-ref: ${{ inputs.devex_ref || 'main' }}
+          model: ${{ inputs.model }}
+          selection-model: ${{ inputs.selection_model }}
+          model-provider: ${{ inputs.model_provider || 'github-models' }}
+          max-input-tokens: ${{ inputs.max_input_tokens }}
+          max-output-tokens: ${{ inputs.max_output_tokens }}
+          dry-run: ${{ inputs.dry_run == true || inputs.dry_run == 'true' || env.ACT == 'true' }}


In pull_request-triggered runs, the inputs context is not available, but this job uses ${{ inputs.* }} in with:. That can cause the workflow to error at evaluation time. Consider deriving repository/pr-number from github.* only for pull_request events (or split into a pull_request wrapper workflow that calls a workflow_call-only reusable workflow).

Copilot · 2026-04-14T19:57:26Z

+# Overture PRojection — AI PR review workflow.
+#
+# Triggers automatically on pull_request events, or manually via workflow_dispatch
+# to review any open PR by number.
+#
+# ── Local testing with act ────────────────────────────────────────────────────
+#
+# Use workflow_dispatch so --input flags are honoured. Do NOT pass -e with a
+# pull_request event payload — act ignores --input when the event file contains
+# a different event type. All PR targeting is done via --input pr_number and
+# --input repository instead.
+#
+# dry-run is set automatically (ACT=true env var is set by act).
+#
+#   GitHub Models — PowerShell:
+#     act workflow_dispatch `
+#       -s GITHUB_TOKEN=$(gh auth token) `
+#       -P ubuntu-slim=catthehacker/ubuntu:act-latest `
+#       --input pr_number=<N> `
+#       --input repository=OvertureMaps/<repo>
+#
+#   GitHub Models — bash/zsh:
+#     act workflow_dispatch \
+#       -s GITHUB_TOKEN=$(gh auth token) \
+#       -P ubuntu-slim=catthehacker/ubuntu:act-latest \
+#       --input pr_number=<N> \
+#       --input repository=OvertureMaps/<repo>
+#
+#   Anthropic — PowerShell:
+#     act workflow_dispatch `
+#       -s GITHUB_TOKEN=$(gh auth token) `
+#       -s ANTHROPIC_API_KEY=sk-ant-... `
+#       -P ubuntu-slim=catthehacker/ubuntu:act-latest `
+#       --input pr_number=<N> `
+#       --input repository=OvertureMaps/<repo> `
+#       --input model_provider=anthropic
+#       # model, max_input_tokens, max_output_tokens, selection_model all default
+#       # automatically for anthropic (claude-opus-4-6, 190000, 4096, claude-haiku-4-6)
+#
+#   Anthropic — bash/zsh:
+#     act workflow_dispatch \
+#       -s GITHUB_TOKEN=$(gh auth token) \
+#       -s ANTHROPIC_API_KEY=sk-ant-... \
+#       -P ubuntu-slim=catthehacker/ubuntu:act-latest \
+#       --input pr_number=<N> \
+#       --input repository=OvertureMaps/<repo> \
+#       --input model_provider=anthropic
+#       # model, max_input_tokens, max_output_tokens, selection_model all default
+#       # automatically for anthropic (claude-opus-4-6, 190000, 4096, claude-haiku-4-6)
+#
+# ── Manual trigger (no act) ───────────────────────────────────────────────────
+#
+#   gh workflow run overture-projection.yml \
+#     -f pr_number=<N> \
+#     -f dry_run=true
+#
+# ── Required permissions ──────────────────────────────────────────────────────
+#
+# Token scopes: repo (read), pull-requests (write), issues (read), models (read)
+# Org secret OVERTURE_PROJECTION_APP_PEM used for cross-repo context file reads.
+#


The header documentation block is duplicated (the entire “Local testing with act / Required permissions” section appears twice). This makes future edits error-prone; please delete the repeated block so there’s a single source of truth.

Suggested change

# Overture PRojection — AI PR review workflow.

#

# Triggers automatically on pull_request events, or manually via workflow_dispatch

# to review any open PR by number.

#

# ── Local testing with act ────────────────────────────────────────────────────

#

# Use workflow_dispatch so --input flags are honoured. Do NOT pass -e with a

# pull_request event payload — act ignores --input when the event file contains

# a different event type. All PR targeting is done via --input pr_number and

# --input repository instead.

#

# dry-run is set automatically (ACT=true env var is set by act).

#

# GitHub Models — PowerShell:

# act workflow_dispatch `

# -s GITHUB_TOKEN=$(gh auth token) `

# -P ubuntu-slim=catthehacker/ubuntu:act-latest `

# --input pr_number=<N> `

# --input repository=OvertureMaps/<repo>

#

# GitHub Models — bash/zsh:

# act workflow_dispatch \

# -s GITHUB_TOKEN=$(gh auth token) \

# -P ubuntu-slim=catthehacker/ubuntu:act-latest \

# --input pr_number=<N> \

# --input repository=OvertureMaps/<repo>

#

# Anthropic — PowerShell:

# act workflow_dispatch `

# -s GITHUB_TOKEN=$(gh auth token) `

# -s ANTHROPIC_API_KEY=sk-ant-... `

# -P ubuntu-slim=catthehacker/ubuntu:act-latest `

# --input pr_number=<N> `

# --input repository=OvertureMaps/<repo> `

# --input model_provider=anthropic

# # model, max_input_tokens, max_output_tokens, selection_model all default

# # automatically for anthropic (claude-opus-4-6, 190000, 4096, claude-haiku-4-6)

#

# Anthropic — bash/zsh:

# act workflow_dispatch \

# -s GITHUB_TOKEN=$(gh auth token) \

# -s ANTHROPIC_API_KEY=sk-ant-... \

# -P ubuntu-slim=catthehacker/ubuntu:act-latest \

# --input pr_number=<N> \

# --input repository=OvertureMaps/<repo> \

# --input model_provider=anthropic

# # model, max_input_tokens, max_output_tokens, selection_model all default

# # automatically for anthropic (claude-opus-4-6, 190000, 4096, claude-haiku-4-6)

#

# ── Manual trigger (no act) ───────────────────────────────────────────────────

#

# gh workflow run overture-projection.yml \

# -f pr_number=<N> \

# -f dry_run=true

#

# ── Required permissions ──────────────────────────────────────────────────────

#

# Token scopes: repo (read), pull-requests (write), issues (read), models (read)

# Org secret OVERTURE_PROJECTION_APP_PEM used for cross-repo context file reads.

#

Copilot · 2026-04-14T19:57:26Z

+1. **Load skills** — sparse-checkouts `omf-devex/skills/`, parses frontmatter, filters to `pr-reviewer` surface. Raw content is stored; nothing is fetched yet.
+2. **Fetch PR diff** — title, body, branch refs, closing issues (GraphQL), and changed file patches up to `max-diff-chars`.
+3. **Select skills** — a fast/cheap model reads skill descriptions and changed file paths, picks which optional skills apply, and logs its reasoning. `always-skills` bypass this step entirely.
+4. **Fetch context files** — only for selected skills; fetched in parallel via the GitHub App token, compressed, and capped at 5,000 chars each.


Step 4 says context files are “capped at 5,000 chars each”, but fetch-context.js computes a dynamic per-file cap (10% of max-input-tokens) and only applies a hard cap when max-context-file-chars is set. Please update the README to reflect the actual behavior (dynamic limit + optional override).

Suggested change

4. **Fetch context files** — only for selected skills; fetched in parallel via the GitHub App token, compressed, and capped at 5,000 chars each.

4. **Fetch context files** — only for selected skills; fetched in parallel via the GitHub App token, compressed, and capped per file using a dynamic limit (10% of `max-input-tokens`) unless `max-context-file-chars` is set to override it with a hard cap.

Copilot · 2026-04-14T19:57:26Z

+  if (!fs.existsSync(skillsDir)) {
+    core.warning(`⚠️ Skills directory not found: ${skillsDir} — no skills will be loaded.`);


fs.existsSync(skillsDir) will throw if SKILLS_DIR is unset/empty (it expects a string/URL/Buffer). Add a guard for a missing SKILLS_DIR (and treat it the same as “directory not found”) so the step fails gracefully with a clear message.

Suggested change

if (!fs.existsSync(skillsDir)) {

core.warning(`⚠️ Skills directory not found: ${skillsDir} — no skills will be loaded.`);

if (!skillsDir || !skillsDir.trim() || !fs.existsSync(skillsDir)) {

const skillsDirMessage = skillsDir && skillsDir.trim() ? skillsDir : '(SKILLS_DIR not set)';

core.warning(`⚠️ Skills directory not found: ${skillsDirMessage} — no skills will be loaded.`);