Skip to content

fix(gemini): cap maxOutputTokens for gemini-3-flash to prevent 400 error#2119

Open
Haixing-Hu wants to merge 1 commit intolbjlaq:mainfrom
Haixing-Hu:fix/gemini-3-flash-maxoutput
Open

fix(gemini): cap maxOutputTokens for gemini-3-flash to prevent 400 error#2119
Haixing-Hu wants to merge 1 commit intolbjlaq:mainfrom
Haixing-Hu:fix/gemini-3-flash-maxoutput

Conversation

@Haixing-Hu
Copy link

gemini-cli sends maxOutputTokens=131072 but gemini-3-flash only supports up to 65536, causing 400 INVALID_ARGUMENT from v1internal API.

Add model-specific maxOutputTokens cap in wrap_request:

  • gemini-3-flash: 65536
  • gemini-2.5-flash: 65536
  • others: 131072 (default)

gemini-cli sends maxOutputTokens=131072 but gemini-3-flash only supports
up to 65536, causing 400 INVALID_ARGUMENT from v1internal API.

Add model-specific maxOutputTokens cap in wrap_request:
- gemini-3-flash: 65536
- gemini-2.5-flash: 65536
- others: 131072 (default)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant