Speed up ConfigModule._get_dict by avoiding unnecessary work by frgossen · Pull Request #179734 · pytorch/pytorch

frgossen · 2026-04-08T18:17:15Z

Stack from ghstack (oldest at bottom):

_get_dict is called from save_config_portable on every AOT autograd
cache key computation.

It called copy.deepcopy on every config value, but the vast
majority are immutable types (bool, int, str, None) that don't
need copying. Now only list/set/dict values are deep-copied.
It went through getattr for every value, which includes
deprecation warning checks, alias resolution, and other overhead.
Now reads values directly from config entries.

On a vLLM Meta-Llama-3-70B-Instruct TP=4 benchmark, this reduces
cold compile time from 29.40 ± 0.90 s to 28.50 ± 0.40 s (1.03x)
and cache lookup time from 6.25 ± 0.30 ms to 5.20 ± 0.45 ms
(1.20x).

Authored with Claude.

cc @oulgen @jamesjwu @aorenste @anijain2305 @laithsakka @penguinwu @masnesral @coconutruben @aditvenk

_get_dict is called from save_config_portable on every AOT autograd cache key computation. 1. It called copy.deepcopy on every config value, but the vast majority are immutable types (bool, int, str, None) that don't need copying. Now only list/set/dict values are deep-copied. 2. It went through __getattr__ for every value, which includes deprecation warning checks, alias resolution, and other overhead. Now reads values directly from config entries. On a vLLM Meta-Llama-3-70B-Instruct TP=4 benchmark, this reduces cold compile time from 29.40 ± 0.90 s to 28.50 ± 0.40 s (1.03x) and cache lookup time from 6.25 ± 0.30 ms to 5.20 ± 0.45 ms (1.20x). Authored with Claude. [ghstack-poisoned]

pytorch-bot · 2026-04-08T18:17:21Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

_get_dict is called from save_config_portable on every AOT autograd cache key computation. 1. It called copy.deepcopy on every config value, but the vast majority are immutable types (bool, int, str, None) that don't need copying. Now only list/set/dict values are deep-copied. 2. It went through __getattr__ for every value, which includes deprecation warning checks, alias resolution, and other overhead. Now reads values directly from config entries. On a vLLM Meta-Llama-3-70B-Instruct TP=4 benchmark, this reduces cold compile time from 29.40 ± 0.90 s to 28.50 ± 0.40 s (1.03x) and cache lookup time from 6.25 ± 0.30 ms to 5.20 ± 0.45 ms (1.20x). Authored with Claude. ghstack-source-id: 936470c Pull Request resolved: #179734

pytorch-bot · 2026-04-08T18:17:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/179734

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull workflow for PyTorch trunk commits

✅ No Failures

As of commit 751e8e4 with merge base acdb423 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-bot · 2026-04-08T19:02:54Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

pytorchmergebot · 2026-04-10T20:06:03Z

Starting merge as part of PR stack under #179910

pytorchmergebot · 2026-04-13T14:44:09Z

Starting merge as part of PR stack under #179910

frgossen · 2026-04-13T15:36:43Z

@pytorchbot merge

pytorchmergebot · 2026-04-13T15:38:58Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

frgossen mentioned this pull request Apr 8, 2026

Skip expensive debug_lines computation in AOT autograd cache #179733

Closed

frgossen marked this pull request as draft April 8, 2026 18:36

frgossen added topic: not user facing topic category module: compile-time Compilation mechanism or time spent in (re)compilation, tracing, startup topic: performance topic category and removed topic: not user facing topic category labels Apr 8, 2026

frgossen added topic: not user facing topic category release notes: aot autograd release notes category and removed topic: not user facing topic category labels Apr 8, 2026

frgossen requested review from aorenste and zou3519 and removed request for zou3519 April 8, 2026 19:10

frgossen marked this pull request as ready for review April 8, 2026 19:14

frgossen mentioned this pull request Apr 10, 2026

Add donate_graph_module option to standalone_compile #179910

Closed

aorenste approved these changes Apr 10, 2026

View reviewed changes

Comment thread torch/utils/_config_module.py

frgossen added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 13, 2026

pytorchmergebot added the merging label Apr 13, 2026

pytorchmergebot closed this in 0e7f09c Apr 13, 2026

pytorchmergebot added the Merged label Apr 13, 2026

pytorchmergebot removed the merging label Apr 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up ConfigModule._get_dict by avoiding unnecessary work#179734

Speed up ConfigModule._get_dict by avoiding unnecessary work#179734
frgossen wants to merge 1 commit intogh/frgossen/16/basefrom
gh/frgossen/16/head

frgossen commented Apr 8, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 8, 2026

Uh oh!

pytorch-bot Bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 8, 2026

Uh oh!

Uh oh!

pytorchmergebot commented Apr 10, 2026

Uh oh!

pytorchmergebot commented Apr 13, 2026

Uh oh!

frgossen commented Apr 13, 2026

Uh oh!

pytorchmergebot commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

frgossen commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Apr 8, 2026

This PR needs a release notes: label

Uh oh!

pytorch-bot Bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/179734

❗ 1 Active SEVs

✅ No Failures

Uh oh!

pytorch-bot Bot commented Apr 8, 2026

This PR needs a release notes: label

Uh oh!

Uh oh!

pytorchmergebot commented Apr 10, 2026

Uh oh!

pytorchmergebot commented Apr 13, 2026

Uh oh!

frgossen commented Apr 13, 2026

Uh oh!

pytorchmergebot commented Apr 13, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

frgossen commented Apr 8, 2026 •

edited

Loading

This PR needs a `release notes:` label

pytorch-bot Bot commented Apr 8, 2026 •

edited

Loading

This PR needs a `release notes:` label