Sensitive-path pattern guard for run_command arguments

## Background

The shell tool's allowlist + per-prefix risky-arg demotion (#257, #258) blocks destructive operations, but read-only commands like `cat`, `grep`, `find`, `head`, `tail` are still on the fast path with *any* path argument. The agent can `cat ~/.ssh/id_rsa` or `grep -r AWS_SECRET /etc` and pipe the contents straight into the LLM provider's chat context — a real exfiltration vector even on a local CLI.

OWASP AI Agent Cheat Sheet § 8 (Data Protection / Path traversal) and § 1's example config (`"blocked_patterns": ["*.env", "*.key", "*.pem", "*secret*"]`) frame this as the complement to per-tool least-privilege: not just *which* command, but *against which resources*.

## Proposal

Path-argument scanner on top of the existing allowlist gate. When a `run_command` call matches the allowlist **and** one of its argv tokens resolves into a configured sensitive prefix or matches a sensitive filename pattern, demote it to the confirm gate (same code path as `RISKY_ARGS`).

Default blocklist (suggested):

- **Path prefixes**: `~/.ssh`, `~/.aws`, `~/.gnupg`, `~/.kube`, `/etc/shadow`, `/etc/sudoers`
- **Filename patterns**: `*.env`, `*.env.*`, `*.key`, `*.pem`, `id_rsa*`, `id_ed25519*`, `*credentials*`, `*secret*`

User-configurable. Case-insensitive on Windows. Tilde expansion + cwd-relative resolution required.

## Acceptance criteria

- [ ] New `SENSITIVE_PATHS` config layer (default list + user override)
- [ ] Path-arg scanner in `src/tools/shell.ts` that flags any argv token resolving into the blocklist
- [ ] Demotion fires in the same code path as `RISKY_ARGS` (chains + confirm gate work uniformly)
- [ ] Tests: absolute paths, `~/`-relative paths, cwd-relative paths, glob patterns, case-insensitive on win32
- [ ] No false positives on legit project paths; `./src/.env.example` *should* still trigger (we do not want the agent reading `.env`-shaped files at all)

## Out of scope

- Output-side scanning (post-execution PII filter on stdout) — separate concern, much larger
- Sandboxing / seccomp / chroot — the local-CLI threat model assumes user trust of the binary, just not of the model

## References

- OWASP AI Agent Cheat Sheet § 1, § 8
- Sibling work: #257, #258

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sensitive-path pattern guard for run_command arguments #259

Background

Proposal

Acceptance criteria

Out of scope

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Sensitive-path pattern guard for run_command arguments #259

Description

Background

Proposal

Acceptance criteria

Out of scope

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions