SDD has no effort-level dimension: subagents run at session effort regardless of task complexity

<html>
<body>
<html><head></head><body><ul>
<li>[x] I searched existing issues and this has not been proposed before</li>
</ul>
<p><em>(Closest prior art, checked open and closed: #1631 covers the model half of this — workers inherit the parent <strong>model</strong> — and the #1744/#1717/#1716 cost-tiering stack fixes which <strong>model</strong> each role runs on, with zero mentions of effort. #1120 is review cadence. This proposal is the reasoning-effort dimension of the same cost problem; if maintainers consider it subsumed by the #1744 direction, happy to close.)</em></p>
<h2>What problem does this solve?</h2>
<p>Running an SDD-patterned pipeline on Claude Code, I noticed every subagent — implementer, spec reviewer, quality reviewer — runs at the session's effort configuration, because effort can't be set at dispatch: Claude Code's Task tool can override <code>model</code> per-invocation but not <code>effort</code> — the only per-subagent effort control is custom agent-definition frontmatter — and SDD dispatches via prompt templates on general-purpose agents.</p>
<p>The misallocation goes both ways. <code>writing-plans</code> produces 2–5 minute tasks with complete code in the plan; a high-effort session spends high-effort reasoning transcribing them (the effort-shaped twin of #1744's measured "most capable model for transcription-grade tasks" failure, which its model fix doesn't touch). Meanwhile a default-effort session runs the final whole-branch review — the dispatch #1744 deliberately pins to the most capable <em>model</em> — at default effort.</p>
<p>This is not Claude Code-specific; the harnesses just sit at different points:</p>
<ul>
<li><strong>Claude Code</strong>: <code>effort</code> (low/medium/high/xhigh/max; levels vary by model — Sonnet 4.6 has no xhigh, and unsupported levels fall back to the nearest supported level below). It is frontmatter-only, so per-role control requires pre-defined agent files. We measured the delivery path at the API request level (logging proxy, CC 2.1.176): subagent calls carry <code>output_config.effort</code> from the agent's frontmatter, overriding the session level (isolated: an <code>effort: low</code> agent under an effort-high session produced <code>"effort":"low"</code> on the wire), while the same calls are sent with <code>thinking: {"type": "disabled"}</code> — so prompt-level thinking cues are inert for subagents, and Haiku (which has no effort parameter) is left with no per-role reasoning control at all (see anthropics/claude-code#14321).</li>
<li><strong>Codex</strong>: <code>model_reasoning_effort</code> (minimal/low/medium/high/xhigh) is a <strong>first-class per-custom-agent key</strong> in agent TOML files, inheriting unset keys from the parent session. The knob SDD would need already exists natively — it's simply unused.</li>
<li><strong>Gemini CLI</strong>: Gemini 3's <code>thinking_level</code> (low/high/dynamic) exists at the API level, but the harness exposes reasoning depth at session/model-selection level; no per-role mechanism I could find.</li>
</ul>
<p>SDD already encodes "capability should match the task" for models (Model Selection guidance, now the #1744 tiering). Effort is the same dial's second axis, currently welded to the session on every harness.</p>
<h2>Proposed solution</h2>
<p>Add an effort dimension to SDD's dispatch guidance, implemented per-harness where mechanisms exist:</p>
<ul>
<li><strong>Claude Code</strong>: ship effort-pinned agent definitions in the plugin's <code>agents/</code> dir for the roles SDD uses (e.g. <code>implementer-sonnet-med</code>, <code>quality-reviewer-opus-high</code>), dispatched by name, with today's inline templates as the fallback path. Each file is ~15 lines of frontmatter plus the existing role prompt.</li>
<li><strong>Codex</strong>: set <code>model_reasoning_effort</code> in the role's custom-agent config alongside the model choice the cost-tiering work already makes.</li>
<li><strong>Other harnesses</strong>: no change until a per-role mechanism exists; optionally document the session-effort interaction ("SDD sessions at xhigh pay an effort tax on every micro-task").</li>
</ul>
<p>Deliberately a sketch, not a design — the problem statement is the contribution; the right shape (which roles get which defaults, whether the plan's complexity tags drive it) belongs to the eval harness.</p>
<h2>What alternatives did you consider?</h2>
<ul>
<li><strong>Session-effort guidance only</strong> (docs, zero mechanism): can't express "low for implementers, high for final review" simultaneously, which is the actual win.</li>
<li><strong>Per-invocation <code>effort</code> on Claude Code's Task tool</strong>: would dissolve the problem cleanly, and is already requested upstream (anthropics/claude-code#25669, #43083; #14321 is the thinking-flavored sibling) with no movement yet; agent files work today.</li>
<li><strong>Thinking-token caps</strong>: #1744 already measured these backfiring (E06, output tokens +80%). Effort is a different mechanism — the model's own target-depth control, not a truncation cap — so the E06 result shouldn't transfer a priori, but any cost claim here deserves the same eval treatment before being believed.</li>
<li><strong>Do nothing</strong>: every long autonomous session keeps paying session-effort prices on transcription-grade tasks.</li>
</ul>
<h2>Is this appropriate for core Superpowers?</h2>
<p>Yes with a caveat: the problem and guidance are harness-general and benefit any project type; the mechanisms are harness-conditional (first-class on Codex, agent files on Claude Code, unavailable elsewhere). Same shape as the existing per-harness model-selection guidance.</p>
<h2>Environment (required)</h2>

Field | Value
-- | --
Superpowers version | 5.1.0 (installed, currently disabled — see Context for why experience comes from an SDD-patterned plugin instead)
Harness (Claude Code, Cursor, etc.) | Claude Code
Harness version | 2.1.176
Your model + version | This issue was drafted by Claude Fable 5 (claude-fable-5, claude.ai), directed and reviewed by the human submitter. Sessions where the problem surfaced ran haiku, sonnet, opus, and fable subagents (agent-loops agent frontmatter); default session model claude-fable-5[1m] (1M-context variant)
All plugins installed | agent-loops@john-plugins v0.3.1 (enabled) · superpowers@claude-plugins-official v5.1.0 (disabled) · frontend-design@claude-plugins-official (disabled)
Human partner who reviewed this text | @GamingwithJJ


<h2>Context</h2>
<p>Drafting disclosure: written collaboratively — Claude Fable 5 drafted and verified mechanics (docs + issue-tracker sweep); the human submitter directed scope and reviewed every claim. Experience base, stated plainly: this comes from building and running a private Claude Code plugin patterned on SDD (fresh subagent per task, staged reviews) where we solved the effort gap with role×model×effort agent definitions — not from running superpowers' SDD skill itself. Mechanics above were verified against current Claude Code and Codex documentation; the Claude Code effort/thinking delivery claims were additionally measured directly at the API request level with a logging proxy. We have not yet run a controlled cost/quality comparison of effort-pinned vs uniform-effort roles — happy to run one (same plan, both rosters, n≈4 per arm, pre-registered prediction) and bring the numbers if that would help decide whether this is worth pursuing.</p></body></html>
</body>
</html>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SDD has no effort-level dimension: subagents run at session effort regardless of task complexity #1747

What problem does this solve?

Proposed solution

What alternatives did you consider?

Is this appropriate for core Superpowers?

Environment (required)

Context

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Field	Value
Superpowers version	5.1.0 (installed, currently disabled — see Context for why experience comes from an SDD-patterned plugin instead)
Harness (Claude Code, Cursor, etc.)	Claude Code
Harness version	2.1.176
Your model + version	This issue was drafted by Claude Fable 5 (claude-fable-5, claude.ai), directed and reviewed by the human submitter. Sessions where the problem surfaced ran haiku, sonnet, opus, and fable subagents (agent-loops agent frontmatter); default session model claude-fable-5[1m] (1M-context variant)
All plugins installed	agent-loops@john-plugins v0.3.1 (enabled) · superpowers@claude-plugins-official v5.1.0 (disabled) · frontend-design@claude-plugins-official (disabled)
Human partner who reviewed this text	@GamingwithJJ

Uh oh!

SDD has no effort-level dimension: subagents run at session effort regardless of task complexity #1747

Description

What problem does this solve?

Proposed solution

What alternatives did you consider?

Is this appropriate for core Superpowers?

Environment (required)

Context

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions