Change waitForProcessing to use exponential backoff by robertbrignull · Pull Request #3937 · github/codeql-action

robertbrignull · 2026-05-28T09:54:36Z

The goal is to change waitForProcessing to backoff a bit more aggressively and reduce the maximum number of requests that it makes. During periods where analysis processing is being delayed, we're seeing a large number of extra GET requests that I believe are coming from here. See linked internal issue for more info.

In this PR I'm suggesting changing it to exponential backoff that'll make a maximum of 5 requests, instead of the current behaviour which can make up to 24 requests. I also add a wait before the first poll because processing will always take at least a couple of seconds so it's unlikely that an immediate poll will get anything. Let me know if you think this is reasonable. We can also tweak values to keep the maximum timeout to 2 minutes if that would be better.

Risk assessment

For internal use only. Please select the risk level of this change:

Low risk: Hopefully can be safely tested before merge (either by tests or on a test repository)

Which use cases does this change impact?

Workflow types:

Advanced setup - Impacts users who have custom CodeQL workflows.
Managed - Impacts users with dynamic workflows (Default Setup, Code Quality, ...).

Products:

Code Scanning - The changes impact analyses when analysis-kinds: code-scanning.
Code Quality - The changes impact analyses when analysis-kinds: code-quality.
Other first-party - The changes impact other first-party analyses.
Third-party analyses - The changes affect the upload-sarif action.

Environments:

Dotcom - Impacts CodeQL workflows on github.com and/or GitHub Enterprise Cloud with Data Residency.
GHES - Impacts CodeQL workflows on GitHub Enterprise Server.

How did/will you validate this change?

Test repository - This change will be tested on a test repository before merging.
Unit tests - I am depending on unit test coverage (i.e. tests in .test.ts files).
End-to-end tests - I am depending on PR checks (i.e. tests in pr-checks).

If something goes wrong after this change is released, what are the mitigation and rollback strategies?

Rollback - Change can only be disabled by rolling back the release or releasing a new version with a fix.

How will you know if something goes wrong after this change is released?

Telemetry - I rely on existing telemetry or have made changes to the telemetry.
- Dashboards - I will watch relevant dashboards for issues after the release. Consider whether this requires this change to be released at a particular time rather than as part of a regular release.
- Alerts - New or existing monitors will trip if something goes wrong with this change.

Are there any special considerations for merging or releasing this change?

No special considerations - This change can be merged at any time.

Merge / deployment checklist

Confirm this change is backwards compatible with existing workflows.
Consider adding a changelog entry for this change.
Confirm the readme and docs have been updated if necessary.

Copilot

Pull request overview

This PR changes SARIF processing polling to use exponential backoff, reducing request volume during delayed analysis processing.

Changes:

Replaces fixed 5-second polling with exponential backoff.
Adds an initial delay before the first processing-status request.
Limits processing-status polling to 5 intended attempts.

Show a summary per file

File	Description
`src/upload-lib.ts`	Updates `waitForProcessing` polling constants and loop behavior to use exponential backoff.

Copilot's findings

Files reviewed: 1/1 changed files
Comments generated: 2

robertbrignull · 2026-05-28T10:15:32Z

+      statusCheckBackoff *= STATUS_CHECK_BACKOFF_MULTIPLIER;
+      await util.delay(statusCheckBackoff, { allowProcessExit: false });


I don't think this comment is quite right about the worst case calculations, but it's still a good idea and it would be cleaner to put the timeout check at the end of the loop, instead of doing an extra loop iteration just to break out early.

+    for (
+      let statusCheckingCount = 0;
+      statusCheckingCount <= STATUS_CHECK_MAX_TRIES; // Aborts on the last loop iteration
+      statusCheckingCount++
+    ) {
+      if (statusCheckingCount === STATUS_CHECK_MAX_TRIES) {


mbg

Thanks for preparing this! The change makes to me as-is and there's nothing blocking here.

We may want to review how well chosen the backoff pattern is, but that's something we can adjust later if needed.

Just one comment about the (potential) delay this imposes on the unit tests.

Also agree with Copilot that it would be nice to have a test that checks that this works as expected.

mbg · 2026-05-28T13:32:42Z

+    let statusCheckBackoff = STATUS_CHECK_INITIAL_BACKOFF_MILLISECONDS;
+    await util.delay(statusCheckBackoff, { allowProcessExit: false });


Minor: I think this also adds a STATUS_CHECK_INITIAL_BACKOFF_MILLISECONDS long delay to tests where we can get a "response" from a stubbed client.request faster. If that is the case, we could either make this initial wait conditional on e.g. the NODE_ENV (like in #3930) or stub util.delay in relevant tests to not wait.

Good point. I think skipping this initial wait during tests makes sense.

Looking more closely, the code from #3930 has now changed and there aren't any other references to NODE_ENV is the codebase. I've changed this PR to use isInTestMode(). Do you think that's the right way to do things?

robertbrignull · 2026-06-01T11:03:12Z

We may want to review how well chosen the backoff pattern is, but that's something we can adjust later if needed.

I don't have any good data on what a good backoff pattern would be. This was just the idea that came to me first. But yes, it's easy to adjust it later on.

henrymercer

Echoing Michael's review, this looks good. One comment: I think it would be better to use NODE_ENV as you originally implemented vs isInTestMode — the latter is mainly intended for disabling uploading SARIFs to code scanning, and I don't think it's true when we run the unit tests locally or in CI. We can add NODE_ENV to queries/default-setup-environment-variables.ql to suppress the warning, sorry about the noise from that check.

robertbrignull · 2026-06-04T09:25:29Z

I think it would be better to use NODE_ENV as you originally implemented vs isInTestMode — the latter is mainly intended for disabling uploading SARIFs to code scanning, and I don't think it's true when we run the unit tests locally or in CI.

Thanks for explaining. I'll just remove my last commit.

I looked earlier for the test mode env var and saw lots of cases of CODEQL_ACTION_TEST_MODE: true in workflow files, but you're right it not in the pr-checks workflow file and wouldn't be set locally either.

robertbrignull · 2026-06-04T10:06:33Z

I also tested this with an advanced setup workflow in a test repo and everything worked as expected.

robertbrignull requested a review from a team as a code owner May 28, 2026 09:54

Copilot AI review requested due to automatic review settings May 28, 2026 09:54

Copilot started reviewing on behalf of robertbrignull May 28, 2026 09:54 View session

github-actions Bot added the size/XS Should be very easy to review label May 28, 2026

Copilot AI reviewed May 28, 2026

View reviewed changes

github-actions Bot added size/S Should be easy to review and removed size/XS Should be very easy to review labels May 28, 2026

Change waitForProcessing to use exponential backoff

dfc1411

robertbrignull force-pushed the robertbrignull/waitForProcessing_backoff branch from e930918 to dfc1411 Compare May 28, 2026 10:15

mbg previously approved these changes May 28, 2026

View reviewed changes

Only do initial wait when not running tests

d40e417

robertbrignull dismissed mbg’s stale review via d40e417 June 1, 2026 15:43

github-advanced-security AI found potential problems Jun 1, 2026

View reviewed changes

Comment thread src/upload-lib.ts Dismissed

henrymercer reviewed Jun 2, 2026

View reviewed changes

robertbrignull force-pushed the robertbrignull/waitForProcessing_backoff branch from b37e12b to d40e417 Compare June 4, 2026 09:24

Merge branch 'main' into robertbrignull/waitForProcessing_backoff

6047ac7

henrymercer approved these changes Jun 4, 2026

View reviewed changes

robertbrignull added this pull request to the merge queue Jun 4, 2026

Merged via the queue into main with commit cb1a588 Jun 4, 2026
232 of 244 checks passed

robertbrignull deleted the robertbrignull/waitForProcessing_backoff branch June 4, 2026 10:19

This was referenced Jun 4, 2026

Merge main into releases/v4 #3949

Merged

Merge releases/v4 into releases/v3 #3952

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change waitForProcessing to use exponential backoff#3937

Change waitForProcessing to use exponential backoff#3937
robertbrignull merged 3 commits into
mainfrom
robertbrignull/waitForProcessing_backoff

robertbrignull commented May 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

robertbrignull May 28, 2026 •

edited

Loading

Uh oh!

mbg left a comment

Uh oh!

mbg May 28, 2026

Uh oh!

robertbrignull Jun 1, 2026

Uh oh!

robertbrignull Jun 2, 2026

Uh oh!

robertbrignull commented Jun 1, 2026

Uh oh!

Uh oh!

henrymercer left a comment •

edited

Loading

Uh oh!

robertbrignull commented Jun 4, 2026

Uh oh!

robertbrignull commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

		statusCheckBackoff *= STATUS_CHECK_BACKOFF_MULTIPLIER;
		await util.delay(statusCheckBackoff, { allowProcessExit: false });

		let statusCheckBackoff = STATUS_CHECK_INITIAL_BACKOFF_MILLISECONDS;
		await util.delay(statusCheckBackoff, { allowProcessExit: false });

Uh oh!

Conversation

robertbrignull commented May 28, 2026

Risk assessment

Which use cases does this change impact?

How did/will you validate this change?

If something goes wrong after this change is released, what are the mitigation and rollback strategies?

How will you know if something goes wrong after this change is released?

Are there any special considerations for merging or releasing this change?

Merge / deployment checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Copilot's findings

Uh oh!

robertbrignull May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mbg left a comment

Choose a reason for hiding this comment

Uh oh!

mbg May 28, 2026

Choose a reason for hiding this comment

Uh oh!

robertbrignull Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

robertbrignull Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

robertbrignull commented Jun 1, 2026

Uh oh!

Uh oh!

henrymercer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

robertbrignull commented Jun 4, 2026

Uh oh!

robertbrignull commented Jun 4, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

robertbrignull May 28, 2026 •

edited

Loading

henrymercer left a comment •

edited

Loading