[3.15] gh-149584: Fix excessive overhead in the Tachyon profiler regarding the cache behavior (GH-149649) by miss-islington · Pull Request #150152 · python/cpython

miss-islington · 2026-05-20T11:32:17Z

Use exact remote reads for interpreter state, thread state, and
interpreter frame structs instead of pulling full remote pages into the
profiler page cache. This matches the core change from
#149585.

The profiler clears the page cache between samples, so live entries are
always packed at the front. Track the live count and only clear/search
that prefix instead of scanning all 1024 slots on the hot path.

Use the frame cache to predict the next thread state and top frame
address, then batch interpreter/thread/frame reads with process_vm_readv
when profiling a Linux target. Reuse prefetched frame buffers in the
frame walker when the prediction is valid.

Cache the last FrameInfo tuple per code object/instruction offset, reuse
cached thread id objects, and append cached parent frames directly on
full frame-cache hits. This cuts Python allocation churn in the
steady-state profiler path.
(cherry picked from commit 661df25)

Co-authored-by: Pablo Galindo Salgado Pablogsal@gmail.com

Issue: _remote_debugging: reading whole pages over and over #149584

…ding the cache behavior (pythonGH-149649) Use exact remote reads for interpreter state, thread state, and interpreter frame structs instead of pulling full remote pages into the profiler page cache. This matches the core change from python#149585. The profiler clears the page cache between samples, so live entries are always packed at the front. Track the live count and only clear/search that prefix instead of scanning all 1024 slots on the hot path. Use the frame cache to predict the next thread state and top frame address, then batch interpreter/thread/frame reads with process_vm_readv when profiling a Linux target. Reuse prefetched frame buffers in the frame walker when the prediction is valid. Cache the last FrameInfo tuple per code object/instruction offset, reuse cached thread id objects, and append cached parent frames directly on full frame-cache hits. This cuts Python allocation churn in the steady-state profiler path. (cherry picked from commit 661df25) Co-authored-by: Pablo Galindo Salgado <Pablogsal@gmail.com>

miss-islington requested a review from pablogsal as a code owner May 20, 2026 11:32

This was referenced May 20, 2026

_remote_debugging: reading whole pages over and over #149584

Open

gh-149584: Fix excessive overhead in the Tachyon profiler regarding the cache behavior #149649

Merged

bedevere-app Bot added the awaiting review label May 20, 2026

pablogsal approved these changes May 20, 2026

View reviewed changes

bedevere-app Bot removed the awaiting review label May 20, 2026

pablogsal enabled auto-merge (squash) May 20, 2026 11:33

bedevere-app Bot added the awaiting merge label May 20, 2026

pablogsal merged commit 034c536 into python:3.15 May 20, 2026
60 checks passed

bedevere-app Bot removed the awaiting merge label May 20, 2026

miss-islington deleted the backport-661df25-3.15 branch May 20, 2026 11:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[3.15] gh-149584: Fix excessive overhead in the Tachyon profiler regarding the cache behavior (GH-149649)#150152

[3.15] gh-149584: Fix excessive overhead in the Tachyon profiler regarding the cache behavior (GH-149649)#150152
pablogsal merged 1 commit into
python:3.15from
miss-islington:backport-661df25-3.15

miss-islington commented May 20, 2026 •

edited by bedevere-app Bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

miss-islington commented May 20, 2026 • edited by bedevere-app Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

miss-islington commented May 20, 2026 •

edited by bedevere-app Bot

Loading