Skip to content

Conversation

@mfenderov
Copy link
Contributor

Prompt caching is automatically enabled for models that support it (detected via models.dev) to reduce latency and costs. System prompts, tool definitions, and recent messages are cached with a 5-minute TTL.

To disable:

provider_opts:
  disable_prompt_caching: true

P.S. Benchmarked with examples/pr-reviewer-bedrock.yaml: 92% cache read vs 8% cache write.

Assisted-By: cagent

@mfenderov mfenderov requested a review from a team as a code owner January 13, 2026 11:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant