Dev by im4codes · Pull Request #10 · im4codes/imcodes

im4codes · 2026-04-21T08:39:35Z

No description provided.

…of being overridden by them Observed failure: user set global rule "Always commit and push if asked!" in the supervision defaults. A session hit idle with uncommitted work; user asked "提交了么?" (did you commit?), agent answered "还没提交" (no). Supervisor returned `complete` — rule was never enforced. Root cause — two heuristics in the decision-prompt rule list were structurally able to defeat any user rule: 1. "A factual answer to a user question ... is typically complete for that turn; the user asked a question, the agent answered it. Do not treat state reports as proposed work." 2. "A user-set supervision rule phrased conditionally ('if asked', 'when X') is conditional. Check whether the condition actually fires in the current turn before using it to justify continue." The arbiter LLM took "Always commit and push if asked!" at heuristic #2's narrowest reading ("the user didn't literally command 'commit it' this turn → condition didn't fire") and combined it with heuristic #1 to justify `complete` on the Q-and-A turn. Result: the user's enforce-this rule was silently downgraded to "advice the arbiter may ignore". Fix — reorder and rewrite: - New top-of-list clause: "USER-SET SUPERVISION RULES ARE AUTHORITATIVE." This is the first decision rule the arbiter reads. It says the user- rules block overrides the generic heuristics below it, gives concrete worked examples for: * commit/push rules (matches the current failure mode verbatim) * blanket wording ("always", "每次", "必须", "绝不") → unconditional * conditional wording ("if asked", "when X", "如果", "当") → interpret GENEROUSLY in the user's favor: the topic appearing in the conversation IS the condition firing. - Heuristic #1 ("factual Q&A → complete") now explicitly reads "typically complete for that turn IF no user-set rule applies" — so it still covers ordinary questions but stops poaching turns that a user rule governs. - Heuristic #2 (the conditional-rule escape hatch) is removed; its responsibility is folded into the authoritative clause, which now owns all conditional-rule handling from the user-rules-always-win side. - Repair prompt mirrors the same clause so JSON-invalid fallbacks can't drop back into the old behavior. All 71 existing supervision prompt / config / broker tests stay green. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

IM.codes and others added 7 commits April 21, 2026 14:29

fix codex websearch labels and default pgvector image

ba98d12

Merge remote-tracking branch 'origin/master' into dev

44c0185

fix: collapse duplicate auto supervision notes

0e2c623

fix: upload oversized pasted chat text as attachments

959cd5f

fix: lower pasted text attachment threshold

f1c84f2

fix: dedupe supervision auto notes

0a752ed

im4codes merged commit 482b504 into master Apr 21, 2026
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev#10

Dev#10
im4codes merged 7 commits intomasterfrom
dev

im4codes commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

im4codes commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant