Problem
The Copilot SDK does not expose a configuration path to route subagent and tool-call tasks through generic local inference endpoints (e.g., LM Studio, Ollama). While BYOM providers are supported, they are limited to Anthropic providers — there is no mechanism for arbitrary local model routing.
What BYOM Does Today
- BYOM supports Anthropic provider configuration
- Subagents inherit the session model by design
- The multiplier guard prevents subagent model escalation (see #3565)
What BYOM Does Not Do
- Register generic local inference servers as providers
- Route subagent tasks through LM Studio or other local endpoints
- Support non-Anthropic external model providers
Impact
- Privacy: workspace context leaks to cloud when local routing unavailable
- Security: sensitive code transmitted to third-party inference when no local path exists
- Cost: untracked cloud API usage from subagent tasks that could run locally
- Trust: silent cloud routing violates local-only expectations
Requested Behavior
- SDK configuration for generic local model routing in subagent/tool-call paths (not just Anthropic BYOM)
- Per-request cost/privacy warnings when cloud fallback triggers
- Tiered privacy levels with explicit opt-in for cloud routing
- Default to local-first routing where available
Related
Problem
The Copilot SDK does not expose a configuration path to route subagent and tool-call tasks through generic local inference endpoints (e.g., LM Studio, Ollama). While BYOM providers are supported, they are limited to Anthropic providers — there is no mechanism for arbitrary local model routing.
What BYOM Does Today
What BYOM Does Not Do
Impact
Requested Behavior
Related
modeloverride both ignored) copilot-cli#3565 — multiplier guard silently downgrades subagent modelopencode web) copilot-cli#3301 (local model feature request)