fix(event_loop): ensure tool_use content blocks are valid after max_tokens to prevent unrecoverable state #607

dbschmigelski · 2025-08-04T22:31:58Z

Description

Overview

This PR introduces a new event handling mechanism for managing event loop failures, specifically focusing on the MaxTokensReachedException case.

When an LLM is generating a tool_use content block but hits max_tokens. It leaves the agent in an unrecoverable state. To address this we made a change to raise an exception in #576, This prevented the problematic message from being added to the messages array.

This required implementors to re-implement the same filtering logic. In this PR we apply the filtration by default. This will prevent issues like the one called out in #541 by default. A follow up PR will allow this filtration/transformation to be more configurable.

After this PR, if a MaxTokensReachedException is hit. An agent can be restarted. But, it still terminates. This is because the stopReason is still max_tokens.

Some things to note about ordering

The cleaning is done AFTER the AfterModelInvocationEvent is triggered
The cleaning event is done BEFORE the message is appended to the messages array
The cleaning event is done BEFORE the message is logged in the OTEL tracer
MaxTokensReachedException is thrown LAST, terminating the event_loop

Related Issues

#541
#576
#561 - until this is completed, we will need to overwrite ALL tool_use content blocks. After it is we can overwrite only known broken tool uses.

Documentation PR

Follow up after approval

Type of Change

Bug fix

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

…ns stop reason

Co-authored-by: Nick Clegg <[email protected]>

dbschmigelski · 2025-08-06T13:51:11Z

This PR moved to a new approach.

Rather than using Hooks we leverage the ConversationManager. There is existing art for these exceptional EventLoop cases in the ContextWindowOverflowException handling. This avoids bloat in the Agent constructor. It also now automatically applies the handling as it is applied in the default ConversationManager, SlidingWindowConversationManager.

More importantly however, Strands takes the position that hooks should be reserved for augmenting or manipulating the Agent, at the discretion of the implementor. It should not be used for cases like this where there is a known issue that the SDK handles itself. Internally, hooks may be leveraged. But the way they are applied for cases like these should not be by requiring the user to pass A HookProvider in the Agent constructor.(Agent(hooks=[PatchHook()])

Missing from this PR is telemetry. #617 has been created to equip the builtin conversation managers.

src/strands/agent/conversation_manager/conversation_manager.py

src/strands/agent/conversation_manager/token_limit_recovery.py

src/strands/agent/agent.py

src/strands/agent/conversation_manager/null_conversation_manager.py

src/strands/agent/conversation_manager/recover_tool_use_on_max_tokens_reached.py

tests_integ/test_max_tokens_reached.py

src/strands/event_loop/_recover_message_on_max_tokens_reached.py

dbschmigelski and others added 12 commits July 30, 2025 16:31

fix(event_loop): raise dedicated exception when encountering max toke…

c5e4e51

…ns stop reason

fix: update integ tests

6703819

fix: rename exception message, add to exception, move earlier in cycle

c94b74e

Update tests_integ/test_max_tokens_reached.py

36dd0f9

Co-authored-by: Nick Clegg <[email protected]>

Update tests_integ/test_max_tokens_reached.py

e04c73d

Co-authored-by: Nick Clegg <[email protected]>

linting

cca2f86

Merge branch 'strands-agents:main' into fix-max-tokens

f647baa

Merge branch 'strands-agents:main' into fix-max-tokens

78c5a91

Merge branch 'strands-agents:main' into fix-max-tokens

a208496

feat: add builtin hook provider to address max tokens reached truncation

2e2d4df

tests: modify integ test to inspect message history

447d147

fix: fix linting errors

564895d

dbschmigelski temporarily deployed to auto-approve August 4, 2025 22:32 — with GitHub Actions Inactive

fix: linting

2f118fb

dbschmigelski temporarily deployed to auto-approve August 4, 2025 22:39 — with GitHub Actions Inactive

refactor: switch from hook approach to conversation manager

e5fc51a

dbschmigelski temporarily deployed to auto-approve August 5, 2025 22:30 — with GitHub Actions Inactive

linting

5906fc2

dbschmigelski temporarily deployed to auto-approve August 5, 2025 22:33 — with GitHub Actions Inactive

fix: test contained incorrect assertions

87445a3

dbschmigelski temporarily deployed to auto-approve August 6, 2025 13:46 — with GitHub Actions Inactive

zastrowm requested changes Aug 6, 2025

View reviewed changes

zastrowm reviewed Aug 6, 2025

View reviewed changes

src/strands/agent/conversation_manager/token_limit_recovery.py Outdated Show resolved Hide resolved

fix: add event emission

924fea9

dbschmigelski temporarily deployed to auto-approve August 6, 2025 14:27 — with GitHub Actions Inactive

dbschmigelski requested a review from zastrowm August 6, 2025 14:50

zastrowm requested changes Aug 6, 2025

View reviewed changes

feat: move to async

104f6b4

dbschmigelski had a problem deploying to auto-approve August 6, 2025 18:04 — with GitHub Actions Failure

dbschmigelski had a problem deploying to auto-approve August 6, 2025 18:16 — with GitHub Actions Failure

dbschmigelski changed the title ~~feat(hooks): add builtin hook provider to address max tokens reached truncation~~ feat(hooks): add handle_token_limit_reached to ConversationManager to handle MaxTokensReachedException by default Aug 6, 2025

feat: add max tokens reached test

1da9ba7

dbschmigelski had a problem deploying to auto-approve August 6, 2025 18:33 — with GitHub Actions Failure

linting

623f3c7

dbschmigelski temporarily deployed to auto-approve August 6, 2025 18:35 — with GitHub Actions Inactive

feat: add max tokens reached test

66c4c07

dbschmigelski had a problem deploying to auto-approve August 6, 2025 18:38 — with GitHub Actions Failure

dbschmigelski temporarily deployed to auto-approve August 6, 2025 18:44 — with GitHub Actions Inactive

zastrowm reviewed Aug 6, 2025

View reviewed changes

tests_integ/test_max_tokens_reached.py Outdated Show resolved Hide resolved

zastrowm previously approved these changes Aug 6, 2025

View reviewed changes

dbschmigelski requested a review from zastrowm August 6, 2025 20:18

feat: switch to a default behavior to recover from max tokens reached

4b5c5a7

dbschmigelski dismissed zastrowm’s stale review via 4b5c5a7 August 7, 2025 20:50

dbschmigelski temporarily deployed to auto-approve August 7, 2025 20:50 — with GitHub Actions Inactive

dbschmigelski changed the title ~~feat(hooks): add handle_token_limit_reached to ConversationManager to handle MaxTokensReachedException by default~~ fix(event_loop): ensure tool_use content blocks are valid after max_tokens to prevent unrecoverable state Aug 7, 2025

zastrowm reviewed Aug 8, 2025

View reviewed changes

src/strands/event_loop/_recover_message_on_max_tokens_reached.py Outdated Show resolved Hide resolved

fix: all tool uses now must be replaced

83ad822

dbschmigelski temporarily deployed to auto-approve August 8, 2025 14:00 — with GitHub Actions Inactive

fix: boolean

faa4618

dbschmigelski temporarily deployed to auto-approve August 8, 2025 14:03 — with GitHub Actions Inactive

dbschmigelski mentioned this pull request Aug 8, 2025

[FEATURE] Allow ToolUse as latest message #561

Open

dbschmigelski requested a review from zastrowm August 8, 2025 14:07

zastrowm previously approved these changes Aug 8, 2025

View reviewed changes

src/strands/event_loop/_recover_message_on_max_tokens_reached.py Outdated Show resolved Hide resolved

src/strands/event_loop/_recover_message_on_max_tokens_reached.py Show resolved Hide resolved

remove todo

fa8195f

dbschmigelski dismissed zastrowm’s stale review via fa8195f August 8, 2025 14:36

dbschmigelski temporarily deployed to auto-approve August 8, 2025 14:36 — with GitHub Actions Inactive

zastrowm approved these changes Aug 8, 2025

View reviewed changes

dbschmigelski merged commit 29b2127 into strands-agents:main Aug 8, 2025
12 checks passed

dbschmigelski mentioned this pull request Aug 8, 2025

[FEATURE]: Make MaxTokens handling Configurable and add additional Exceptions on terminal events #637

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(event_loop): ensure tool_use content blocks are valid after max_tokens to prevent unrecoverable state #607

fix(event_loop): ensure tool_use content blocks are valid after max_tokens to prevent unrecoverable state #607

Uh oh!

dbschmigelski commented Aug 4, 2025 •

edited by zastrowm

Loading

Uh oh!

dbschmigelski commented Aug 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix(event_loop): ensure tool_use content blocks are valid after max_tokens to prevent unrecoverable state #607

fix(event_loop): ensure tool_use content blocks are valid after max_tokens to prevent unrecoverable state #607

Uh oh!

Conversation

dbschmigelski commented Aug 4, 2025 • edited by zastrowm Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Overview

Related Issues

Documentation PR

Type of Change

Testing

Checklist

Uh oh!

dbschmigelski commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dbschmigelski commented Aug 4, 2025 •

edited by zastrowm

Loading

dbschmigelski commented Aug 6, 2025 •

edited

Loading