Mitigate safe-output PR review rate-limit failures by Copilot · Pull Request #39797 · github/gh-aw

Copilot · 2026-06-17T13:55:19Z

Safe-output review finalization was failing with 403 API rate limit exceeded for installation after buffering review comments. The hot path was spending quota on pre/post review metadata fetches around pulls.createReview, so a brief quota stall could drop the entire review.

Review finalization
- make PR review state capture best-effort for transient/rate-limit failures instead of fatal
- preserve review submission when transient pulls.get / pulls.listReviews failures occur under rate pressure
- treat non-transient metadata capture failures as non-retryable errors
- only attach before_state / after_state metadata when it is actually available
Rate-limit recovery
- wrap pulls.createReview in bounded retry logic for rate-limit responses
- keep retries intentionally small for this path (one retry) with short delay and small jitter
- honor GitHub-provided wait signals through the existing retry machinery
Quota reduction in degraded mode
- defer before-state capture until after early exits (missing SHA, staged mode, empty review guard paths)
- fetch review state in bail-fast sequence (pulls.get first, then pulls.listReviews) to avoid spending an extra call when PR fetch fails
- skip the follow-up after-state fetch once the flow has already degraded due to metadata capture failure
- avoid spending another API call on observability when quota is already constrained
Tests
- add retry-exhaustion coverage for createReview rate-limit failures
- add own-PR fallback coverage where COMMENT retry path is itself rate-limited and retried

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot

Pull request overview

This PR improves reliability of safe-output PR review submission under GitHub App installation token rate limiting by making execution-state metadata capture best-effort and adding bounded retry behavior around pulls.createReview, while avoiding additional metadata calls once the flow has degraded.

Changes:

Add best-effort “before/after” review state capture that logs warnings and proceeds without metadata when rate-limited or otherwise failing.
Wrap pulls.createReview in bounded retry logic (single retry, capped delay) and reuse it across fallback review submission paths.
Add targeted tests for degraded-mode state capture and createReview rate-limit retry behavior.

Show a summary per file

File	Description
actions/setup/js/pr_review_buffer.cjs	Makes review state capture non-fatal, skips after-state fetch in degraded mode, and introduces `createReview` retry wrapper.
actions/setup/js/pr_review_buffer.test.cjs	Adds tests covering rate-limited before-state capture and one-retry behavior for `createReview`.

Copilot's findings

Tip

Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Files reviewed: 2/2 changed files
Comments generated: 3

    const { repo, repoParts, pullRequestNumber, pullRequest } = reviewContext;
-    const beforeState = await fetchPullRequestReviewState(github, repoParts, pullRequestNumber);
+    const beforeState = await fetchReviewStateBestEffort(repoParts, pullRequestNumber, "before");



+const REVIEW_RATE_LIMIT_RETRY_CONFIG = {
+  ...RATE_LIMIT_RETRY_CONFIG,
+  maxRetries: 1,
+  maxDelayMs: 60000,
+};


+  async function fetchReviewStateBestEffort(repoParts, pullRequestNumber, phase) {
+    try {
+      return await fetchPullRequestReviewState(github, repoParts, pullRequestNumber);
+    } catch (error) {
+      core.warning(`Failed to capture ${phase} PR review state for #${pullRequestNumber}: ${getErrorMessage(error)}. Continuing without execution-state metadata.`);


github-actions · 2026-06-17T14:19:36Z

✅ Design Decision Gate 🏗️ completed the design decision gate check.

No ADR enforcement needed: PR #39797 does not have the 'implementation' label (has_implementation_label=false) and has 0 new lines of code in business logic directories (well below the 100-line threshold).

github-actions · 2026-06-17T14:19:50Z

✅ PR Code Quality Reviewer completed the code quality review.

github-actions · 2026-06-17T14:19:56Z

✅ Test Quality Sentinel completed test quality analysis.

github-actions · 2026-06-17T14:19:57Z

🧠 Matt Pocock Skills Reviewer has completed the skills-based review. ✅

github-actions

Skills-Based Review 🧠

Applied /diagnose and /tdd — good targeted fix with solid primary tests; a few coverage gaps and one config-readability note worth addressing.

📋 Key Themes & Highlights

Key Themes

Test coverage gaps: retry/degraded-mode tests exist only for the main submission path; the own-PR fallback, body-only fallback, and exhausted-retry paths are untested with the new behaviour
Config transparency: REVIEW_RATE_LIMIT_RETRY_CONFIG silently inherits initialDelayMs and jitterMs from the base config; the actual first-retry sleep (~30 s without Retry-After) is not visible at the definition site

Positive Highlights

✅ Root cause correctly addressed — both the metadata-capture path and the createReview call are now rate-limit resilient
✅ fetchAfterStateIfAvailable short-circuiting after-state when before-state failed is an elegant quota-conserving design
✅ REVIEW_RATE_LIMIT_RETRY_CONFIG.maxRetries: 1 is the right call — no runaway waits that eat into the workflow timeout window
✅ attachExecutionState's falsy-guard on beforeState/afterState means null flows through cleanly; the test assertion toBeUndefined() is correct
✅ Both new tests use realistic error shapes (403 + rate-limit headers) matching what the GitHub API actually returns

🧠 Reviewed using Matt Pocock's skills by Matt Pocock Skills Reviewer

github-actions · 2026-06-17T14:32:01Z

+  ...RATE_LIMIT_RETRY_CONFIG,
+  maxRetries: 1,
+  maxDelayMs: 60000,
+};


[/diagnose] REVIEW_RATE_LIMIT_RETRY_CONFIG only overrides maxRetries and maxDelayMs — the inherited initialDelayMs: 15000 and jitterMs: 5000 are invisible here, but they determine the actual retry sleep duration.

💡 What this means in practice

Without a retry-after response header, the single retry will sleep:

min(initialDelayMs × backoffMultiplier, maxDelayMs) + jitter = min(15 000 × 2, 60 000) + rand(0, 4 999) = 30 000–34 999 ms (~30–35 s)

With retry-after: 1 (as in the test) it drops to 1 000–5 999 ms — but a live installation-token exhaustion often does not include that header.

This is almost certainly intentional (a 30 s single-stall budget seems fine), but nothing in the block makes that visible. Consider adding comments or explicit overrides:

const REVIEW_RATE_LIMIT_RETRY_CONFIG = { ...RATE_LIMIT_RETRY_CONFIG, maxRetries: 1, maxDelayMs: 60_000, // inherits initialDelayMs:15000 → first backoff ~30 s when no Retry-After header // inherits jitterMs:5000 → up to 5 s spread };

github-actions · 2026-06-17T14:32:02Z

+      } finally {
+        setTimeoutSpy.mockRestore();
+      }
+    });


[/tdd] The new retry test covers only the success-after-one-retry happy path. There is no test for the exhausted-retry case (initial call + retry both fail with rate-limit errors), which leaves the caller's error-handling path unverified.

💡 Why this matters

When maxRetries: 1 is exhausted, withRetry rethrows an enhanced error with extra metadata (retryable: true, suggestion: ...). The outer catch (error) block in submitReview calls getErrorMessage(error) and returns { success: false, error: ... }. That path looks correct, but a regression test would lock it in.

Suggested addition (sibling to this it block):

it("should return success:false when createReview rate-limit retry is exhausted", async () => { buffer.setReviewMetadata("Looks good", "COMMENT"); buffer.setReviewContext({ repo: "owner/repo", repoParts: { owner: "owner", repo: "repo" }, pullRequestNumber: 42, pullRequest: { head: { sha: "abc123" } } }); const rateLimitError = new Error("API rate limit exceeded for installation"); rateLimitError.response = { status: 403, headers: { "x-ratelimit-remaining": "0", "retry-after": "1" } }; // Both attempts fail mockGithub.rest.pulls.createReview.mockRejectedValue(rateLimitError); const result = await buffer.submitReview(); expect(result.success).toBe(false); expect(result.error).toBeDefined(); });

github-actions · 2026-06-17T14:32:02Z

-          const { data: review } = await github.rest.pulls.createReview(requestParams);
+          const { data: review } = await createReviewWithRetry(requestParams);
          await maybeSupersedeOlderReviews(review.id);
+          const afterState = await fetchAfterStateIfAvailable();


[/tdd] The own-PR COMMENT fallback (here) and the body-only fallback (line ~626) both call createReviewWithRetry and fetchAfterStateIfAvailable, but neither path has a rate-limit or degraded-state test.

💡 Why this is worth covering

The main submission path now has two regression tests. The fallback paths share the same helpers so they're unlikely to break independently — but they're only reached via specific error conditions, so a rate-limit during one of them would produce an interleaved error-handling flow that hasn't been exercised.

At minimum, one test confirming that a rate-limit on createReview inside the own-PR fallback still returns { success: true } (after retry) or { success: false } (after exhaustion) would round out the story.

github-actions · 2026-06-17T14:32:52Z

🧪 Test Quality Sentinel Report

✅ Test Quality Score: 100/100 — Excellent

Analyzed 2 new JavaScript test(s) in actions/setup/js/pr_review_buffer.test.cjs: 2 design tests, 0 implementation tests, 0 guideline violations.

📊 Metrics & Test Classification (2 tests analyzed)

Metric	Value
New/modified tests analyzed	2
✅ Design tests (behavioral contracts)	2 (100%)
⚠️ Implementation tests (low value)	0 (0%)
Tests with error/edge cases	2 (100%)
Duplicate test clusters	0
Test inflation detected	No (77 test lines / 47 production lines = 1.64x)
🚨 Coding-guideline violations	0

Test	File	Classification	Issues Detected
`should continue when before-state capture is rate-limited`	`actions/setup/js/pr_review_buffer.test.cjs:258`	✅ Design	—
`should retry createReview once on rate-limit errors`	`actions/setup/js/pr_review_buffer.test.cjs:299`	✅ Design	—

Go: 0 (*_test.go); JavaScript: 2 (*.test.cjs). No other languages detected.

Verdict

✅ Check passed. 0% implementation tests (threshold: 30%). Both new tests verify behavioral contracts — graceful degradation when execution-state capture is rate-limited, and correct retry-with-bounded-delay policy for createReview. Mock targets are all external I/O (github.rest.pulls.*, global.setTimeout) with legitimate justification.

🧪 Test quality analysis by Test Quality Sentinel · ◷

github-actions

✅ Test Quality Sentinel: 100/100. Test quality is excellent — 0% of new tests are implementation tests (threshold: 30%). Both tests cover behavioral contracts (rate-limit graceful degradation and retry-with-bounded-delay policy) with full error-path coverage.

github-actions

Request changes — four medium-severity issues collectively work against the quota-reduction goal this PR sets out to achieve.

💡 Blocking concerns

1. `fetchReviewStateBestEffort` catches all exceptions (new — inline comment, line 111)

The catch block is written as a blanket exception handler. Auth failures, wrong-repo targeting, 404s, and code bugs are silently swallowed and logged as warnings. Only rate-limit / transient failures should be suppressed; everything else must propagate so real breakage is visible.

2. `fetchPullRequestReviewState` fires both API requests concurrently under rate pressure (line 112)

When already rate-limited, the helper still launches both pulls.get and pulls.listReviews in parallel, burning quota on the second call even though the first will fail. A sequential / bail-fast pattern would match the stated quota-reduction goal.

3. `REVIEW_RATE_LIMIT_RETRY_CONFIG` inherits `initialDelayMs: 15000` / `jitterMs: 5000` (line 44)

With one retry and no Retry-After header, backoff produces a ~30–35 s sleep. Jitter is also added after maxDelayMs is applied, so the actual sleep can exceed maxDelayMs by up to ~5 s. If the intent is a short bounded wait for quota stalls, both values should be overridden explicitly.

4. `beforeState` captured before early-exit guards (line 276)

fetchReviewStateBestEffort is called before the staged-mode, empty-review, and missing-SHA guards. Two API calls are spent even when the review will be skipped entirely — the opposite of the PR's quota-reduction intent.

🔎 Code quality review by PR Code Quality Reviewer

github-actions · 2026-06-17T14:33:51Z

+  async function fetchReviewStateBestEffort(repoParts, pullRequestNumber, phase) {
+    try {
+      return await fetchPullRequestReviewState(github, repoParts, pullRequestNumber);
+    } catch (error) {


Catch-all hides real failures: this catches every exception — auth errors, wrong-repo targeting, permission regressions, unexpected API contract changes — and silently treats them all as null state. The PR intent is specifically quota-stall resilience, but the implementation is much wider.

💡 Suggested fix — narrow to rate-limit / transient errors only

} catch (error) { // Only suppress transient / rate-limit failures; let real errors propagate. if (!isTransientError(error)) throw error; core.warning(`Failed to capture ${phase} PR review state for #${pullRequestNumber}: ${getErrorMessage(error)}. Continuing without execution-state metadata.`); return null; }

Import isTransientError from error_recovery.cjs (already handles rate-limit patterns). A broken fetchPullRequestReviewState call signature, wrong credentials, or a 404 should surface as a real error — not disappear into a warning log line.

Done

pelikhan · 2026-06-17T14:47:51Z

@copilot run pr-finisher skill

github-actions · 2026-06-17T14:52:23Z

@copilot review all comments and summarize the remaining blockers.

Generated by 👨‍🍳 PR Sous Chef · ◷

Copilot · 2026-06-17T14:59:51Z

@copilot run pr-finisher skill

Executed the pr-finisher pass on this branch.

Local validation completed successfully (make fmt, make lint, make test-unit, make test, make recompile).
Reviewed PR status/threads and current CI snapshot.
No additional code changes were required in this pass.

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2026-06-17T15:17:09Z

@copilot run pr-finisher skill

Completed. I ran a full pr-finisher pass and pushed the fixes in commit 1f1bedb (plus preceding related commits in this run), including local validation and final review/security validation.

Copilot AI and others added 4 commits June 17, 2026 13:45

Add PR review rate-limit mitigation

c17afe5

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Mitigate PR review rate limits

6417a55

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Refine PR review retry coverage

db4a431

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Document PR review retry behavior

ba3d085

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI assigned Copilot and pelikhan Jun 17, 2026

Copilot AI changed the title ~~Mitigate safe-output PR review rate limits~~ Mitigate safe-output PR review rate-limit failures Jun 17, 2026

Copilot created this pull request from a session on behalf of pelikhan June 17, 2026 13:55 View session

Copilot finished work on behalf of pelikhan June 17, 2026 13:55

Copilot AI requested a review from pelikhan June 17, 2026 13:55

pelikhan marked this pull request as ready for review June 17, 2026 13:57

Copilot AI review requested due to automatic review settings June 17, 2026 13:57

Copilot started reviewing on behalf of pelikhan June 17, 2026 13:57 View session

Copilot AI reviewed Jun 17, 2026

View reviewed changes

github-actions Bot mentioned this pull request Jun 17, 2026

[Contribution Check Report] Contribution Check — 2026-06-17 #39716

Closed

github-actions Bot mentioned this pull request Jun 17, 2026

[aw] No-Op Runs #38951

Closed

github-actions Bot reviewed Jun 17, 2026

View reviewed changes

github-actions Bot approved these changes Jun 17, 2026

View reviewed changes

github-actions Bot previously requested changes Jun 17, 2026

View reviewed changes

Copilot started work on behalf of pelikhan June 17, 2026 14:48 View session

Copilot finished work on behalf of pelikhan June 17, 2026 15:00

Copilot AI and others added 6 commits June 17, 2026 15:03

Tighten PR review retry and metadata capture flow

85e4fa9

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Polish retry jitter and metadata fetch notes

8f00db2

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Clarify non-transient metadata capture failures

2cc4d6b

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Document retry tuning and non-recoverable errors

9d67dd1

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Tighten non-recoverable metadata error text

7bcd50e

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Refine retry test bound and non-transient wording

1f1bedb

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan June 17, 2026 15:19

pelikhan merged commit 6cc236a into main Jun 17, 2026
21 checks passed

pelikhan deleted the copilot/investigate-403-rate-limit branch June 17, 2026 15:25

Conversation

Copilot AI commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

github-actions Bot commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Skills-Based Review 🧠

Key Themes

Positive Highlights

Uh oh!

github-actions Bot Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 17, 2026

🧪 Test Quality Sentinel Report

Verdict

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

1. fetchReviewStateBestEffort catches all exceptions (new — inline comment, line 111)

2. fetchPullRequestReviewState fires both API requests concurrently under rate pressure (line 112)

3. REVIEW_RATE_LIMIT_RETRY_CONFIG inherits initialDelayMs: 15000 / jitterMs: 5000 (line 44)

4. beforeState captured before early-exit guards (line 276)

Uh oh!

github-actions Bot Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

pelikhan commented Jun 17, 2026

Uh oh!

github-actions Bot commented Jun 17, 2026

Uh oh!

Copilot AI commented Jun 17, 2026

Uh oh!

Copilot AI commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Jun 17, 2026 •

edited

Loading

github-actions Bot commented Jun 17, 2026 •

edited

Loading

github-actions Bot commented Jun 17, 2026 •

edited

Loading

github-actions Bot commented Jun 17, 2026 •

edited

Loading

github-actions Bot commented Jun 17, 2026 •

edited

Loading

1. `fetchReviewStateBestEffort` catches all exceptions (new — inline comment, line 111)

2. `fetchPullRequestReviewState` fires both API requests concurrently under rate pressure (line 112)

3. `REVIEW_RATE_LIMIT_RETRY_CONFIG` inherits `initialDelayMs: 15000` / `jitterMs: 5000` (line 44)

4. `beforeState` captured before early-exit guards (line 276)