Hi team,
ATR (Agent Threat Rules) maintains 108 open-source detection rules for AI agent threats — same rules that Cisco AI Defense ships in production (cisco-ai-defense/skill-scanner#79). We'd like to contribute these to Agent Governance Toolkit.
We noticed mcp-security.yaml has 51 detection patterns across 7 categories. ATR covers 9 categories with 87 high-confidence patterns tested on 53,577 real-world MCP skills (0% FP on clean content).
What ATR covers:
- Prompt injection (10 patterns)
- Tool poisoning (10 patterns)
- Credential exfiltration (10 patterns)
- Skill supply chain (10 patterns)
- Agent manipulation (10 patterns)
- Privilege escalation (10 patterns)
- Excessive autonomy (10 patterns)
- Data poisoning (10 patterns)
- Model abuse (7 patterns)
Questions before we submit a PR:
- Would you prefer we enhance mcp-security.yaml or add a separate policy file?
- Any format requirements beyond the existing detection_patterns schema?
- We'd love to discuss ongoing community rule maintenance — would you be open to ATR as an upstream source for detection patterns?
Happy to adjust to whatever approach works best for your architecture.
Hi team,
ATR (Agent Threat Rules) maintains 108 open-source detection rules for AI agent threats — same rules that Cisco AI Defense ships in production (cisco-ai-defense/skill-scanner#79). We'd like to contribute these to Agent Governance Toolkit.
We noticed mcp-security.yaml has 51 detection patterns across 7 categories. ATR covers 9 categories with 87 high-confidence patterns tested on 53,577 real-world MCP skills (0% FP on clean content).
What ATR covers:
Questions before we submit a PR:
Happy to adjust to whatever approach works best for your architecture.