Skip expensive debug_lines computation in AOT autograd cache by frgossen · Pull Request #179733 · pytorch/pytorch

frgossen · 2026-04-08T18:17:10Z

Stack from ghstack (oldest at bottom):

FxGraphCachePickler.debug_lines re-hashes every attribute of the cache
details object individually. This runs unconditionally even when debug
logging is disabled.

Gate the computation behind log.isEnabledFor(logging.DEBUG) so the
cost is only paid when someone is actively debugging cache key
differences.

On a vLLM Meta-Llama-3-70B-Instruct TP=4 benchmark, this reduces
cold compile time from 30.50 ± 0.50 s to 29.40 ± 0.90 s (1.04x)
and cache lookup time from 11.35 ± 0.45 ms to 6.25 ± 0.30 ms
(1.82x).

Authored with Claude.

cc @oulgen @jamesjwu @aorenste @anijain2305 @laithsakka @penguinwu @masnesral @coconutruben @aditvenk

FxGraphCachePickler.debug_lines re-hashes every attribute of the cache details object individually. This runs unconditionally even when debug logging is disabled. Gate the computation behind log.isEnabledFor(logging.DEBUG) so the cost is only paid when someone is actively debugging cache key differences. On a vLLM Meta-Llama-3-70B-Instruct TP=4 benchmark, this reduces cold compile time from 30.50 ± 0.50 s to 29.40 ± 0.90 s (1.04x) and cache lookup time from 11.35 ± 0.45 ms to 6.25 ± 0.30 ms (1.82x). Authored with Claude. [ghstack-poisoned]

pytorch-bot · 2026-04-08T18:17:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/179733

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull workflow for PyTorch trunk commits

✅ No Failures

As of commit d1011f0 with merge base acdb423 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot · 2026-04-08T18:17:18Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

pytorchmergebot · 2026-04-10T20:06:03Z

Starting merge as part of PR stack under #179910

frgossen · 2026-04-10T21:14:43Z

@pytorchbot merge

pytorchmergebot · 2026-04-10T21:17:00Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

_get_dict is called from save_config_portable on every AOT autograd cache key computation. 1. It called copy.deepcopy on every config value, but the vast majority are immutable types (bool, int, str, None) that don't need copying. Now only list/set/dict values are deep-copied. 2. It went through __getattr__ for every value, which includes deprecation warning checks, alias resolution, and other overhead. Now reads values directly from config entries. On a vLLM Meta-Llama-3-70B-Instruct TP=4 benchmark, this reduces cold compile time from 29.40 ± 0.90 s to 28.50 ± 0.40 s (1.03x) and cache lookup time from 6.25 ± 0.30 ms to 5.20 ± 0.45 ms (1.20x). Authored with Claude. Pull Request resolved: #179734 Approved by: https://github.com/aorenste ghstack dependencies: #179733

frgossen requested review from aorenste and bdhirsh as code owners April 8, 2026 18:17

pytorch-bot Bot added the ciflow/inductor label Apr 8, 2026

frgossen mentioned this pull request Apr 8, 2026

Speed up ConfigModule._get_dict by avoiding unnecessary work #179734

Closed

frgossen marked this pull request as draft April 8, 2026 18:36

frgossen added module: compile-time Compilation mechanism or time spent in (re)compilation, tracing, startup release notes: aot autograd release notes category topic: performance topic category labels Apr 8, 2026

frgossen requested a review from zou3519 April 8, 2026 19:16

frgossen marked this pull request as ready for review April 8, 2026 20:17

frgossen removed the request for review from bdhirsh April 8, 2026 20:17

frgossen mentioned this pull request Apr 10, 2026

Add donate_graph_module option to standalone_compile #179910

Closed

aorenste approved these changes Apr 10, 2026

View reviewed changes

pytorch-bot Bot added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 10, 2026

pytorchmergebot added the merging label Apr 10, 2026

pytorchmergebot added the Merged label Apr 10, 2026

pytorchmergebot closed this in aeb31cc Apr 10, 2026

pytorchmergebot removed the merging label Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip expensive debug_lines computation in AOT autograd cache#179733

Skip expensive debug_lines computation in AOT autograd cache#179733
frgossen wants to merge 1 commit intogh/frgossen/15/basefrom
gh/frgossen/15/head

frgossen commented Apr 8, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 8, 2026

Uh oh!

pytorchmergebot commented Apr 10, 2026

Uh oh!

frgossen commented Apr 10, 2026

Uh oh!

pytorchmergebot commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

frgossen commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/179733

❗ 1 Active SEVs

✅ No Failures

Uh oh!

pytorch-bot Bot commented Apr 8, 2026

This PR needs a release notes: label

Uh oh!

pytorchmergebot commented Apr 10, 2026

Uh oh!

frgossen commented Apr 10, 2026

Uh oh!

pytorchmergebot commented Apr 10, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

frgossen commented Apr 8, 2026 •

edited

Loading

pytorch-bot Bot commented Apr 8, 2026 •

edited

Loading

This PR needs a `release notes:` label