Fix Qwen3 thinking mode (#224) by DenisovAV · Pull Request #225 · DenisovAV/flutter_gemma

DenisovAV · 2026-04-18T10:16:22Z

Summary

Qwen3 generates <think> blocks by default — now stripped when isThinking: false
Separate Qwen filter (starts insideThinking=false, detects opening <think> tag) — safe for Qwen2.5 which doesn't generate thinking
Always apply thinking filter for Qwen/DeepSeek/Gemma4, discard ThinkingResponse when user didn't request it

Tested on 6 models × 2 modes = 12 tests passed (Qwen3, Qwen2.5, Gemma4 E2B, Gemma3 1B, Gemma3n E2B, FunctionGemma 270M)

Closes #224

- Add separate Qwen thinking filter (insideThinking=false, detects <think> opening tag) - Always apply thinking filter for models that may generate thinking (Qwen, DeepSeek, Gemma 4) - Strip ThinkingResponse when isThinking=false - Add Qwen3 thinking support to README and model config - Bump version to 0.13.5

- Rewrite DeepSeek and Qwen stream filters with buffer pattern (like Gemma 4) - Handle partial <think>/</think> tags split across token boundaries - Add unit tests for partial tag split cases (DeepSeek + Qwen) - Add Qwen2.5 passthrough test (no thinking tags)

DenisovAV added 2 commits April 18, 2026 12:14

DenisovAV merged commit 20c909c into main Apr 18, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Qwen3 thinking mode (#224)#225

Fix Qwen3 thinking mode (#224)#225
DenisovAV merged 2 commits intomainfrom
fix/issue-224-qwen3-thinking

DenisovAV commented Apr 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DenisovAV commented Apr 18, 2026

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant