feat(messages): implement prompt caching metrics tracking by cdoern · Pull Request #5469 · llamastack/llama-stack

cdoern · 2026-04-07T18:27:45Z

Summary

Implements prompt caching metrics tracking for the Anthropic Messages API by mapping OpenAI's cache metrics to Anthropic's cache fields.

Changes

Maps usage.prompt_tokens_details.cached_tokens → cache_read_input_tokens in non-streaming responses
Maps cache metrics in streaming responses via the final MessageDeltaEvent
cache_creation_input_tokens remains None (OpenAI does not provide this metric)

Implementation

Updated _openai_to_anthropic() to extract and map cache metrics
Updated _stream_openai_to_anthropic() to track and emit cache metrics
Added defensive checks for prompt_tokens_details existence

Testing

Added 2 unit tests:

test_cache_metrics_mapping - verifies cache metrics are properly mapped when present
test_cache_metrics_missing - verifies graceful handling when cache metrics are absent

All 19 unit tests passing.

Test Plan

uv run pytest tests/unit/providers/inline/messages/ -v

🤖 Generated with Claude Code

Add support for mapping OpenAI's cache metrics to Anthropic's cache fields. Maps usage.prompt_tokens_details.cached_tokens to cache_read_input_tokens in both non-streaming and streaming responses. cache_creation_input_tokens remains None as OpenAI does not provide this metric. Includes unit tests for both scenarios (cache metrics present and missing). Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> Signed-off-by: Charlie Doern <cdoern@redhat.com>

meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Apr 7, 2026

cdoern marked this pull request as ready for review April 7, 2026 19:16

cdoern requested review from ashwinb, bbrowning, ehhuang, franciscojavierarceo, leseb, mattf and raghotham as code owners April 7, 2026 19:16

leseb approved these changes Apr 10, 2026

View reviewed changes

franciscojavierarceo approved these changes Apr 10, 2026

View reviewed changes

Merge branch 'main' into feat/messages-cache-metrics

121990a

cdoern enabled auto-merge April 14, 2026 19:54

cdoern added this pull request to the merge queue Apr 14, 2026

Merged via the queue into llamastack:main with commit bc61421 Apr 14, 2026
64 checks passed

cdoern deleted the feat/messages-cache-metrics branch April 14, 2026 20:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(messages): implement prompt caching metrics tracking#5469

feat(messages): implement prompt caching metrics tracking#5469
cdoern merged 2 commits intollamastack:mainfrom
cdoern:feat/messages-cache-metrics

cdoern commented Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cdoern commented Apr 7, 2026

Summary

Changes

Implementation

Testing

Test Plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants