Skip to content

Add opensource pipeline agents (forker, sanitizer, packager)#157

Closed
herakles-dev wants to merge 4 commits intoVoltAgent:mainfrom
herakles-dev:add-opensource-agents
Closed

Add opensource pipeline agents (forker, sanitizer, packager)#157
herakles-dev wants to merge 4 commits intoVoltAgent:mainfrom
herakles-dev:add-opensource-agents

Conversation

@herakles-dev
Copy link
Copy Markdown

Summary

This PR adds three agents that form a complete open-source release pipeline to the Quality & Security category:

  • opensource-forker — Stage 1: Copies a private project to a staging directory, strips 30+ secret patterns (API keys, DB credentials, private keys, tokens, OAuth secrets), replaces internal references (domains, home paths, private IPs), and generates .env.example from all extracted configuration. Produces a FORK_REPORT.md documenting every change.

  • opensource-sanitizer — Stage 2: Independent read-only auditor that verifies the fork is safe for release. Scans across 6 categories (secrets, PII, internal references, dangerous files, config completeness, git history) and produces a SANITIZATION_REPORT.md with a PASS / FAIL / PASS WITH WARNINGS verdict. Never trusts the forker's work.

  • opensource-packager — Stage 3: Generates professional open-source packaging after the sanitizer issues a PASS. Produces CLAUDE.md (Claude Code context file), setup.sh (one-command bootstrap with prerequisite checks), enhanced README.md (with "Using with Claude Code" section), LICENSE, CONTRIBUTING.md, and optional GitHub issue templates.

Pipeline: opensource-forkeropensource-sanitizeropensource-packager

Full pipeline source and orchestration skill: https://github.com/herakles-dev/opensource-pipeline

Changes

  • categories/04-quality-security/opensource-forker.md — new agent
  • categories/04-quality-security/opensource-sanitizer.md — new agent
  • categories/04-quality-security/opensource-packager.md — new agent
  • categories/04-quality-security/README.md — added agent descriptions, updated Quick Selection Guide and Common Patterns
  • README.md — added three entries in alphabetical order within the Quality & Security section

Test plan

  • All three agent files have valid YAML frontmatter (name, description, tools, model)
  • Agents are added to categories/04-quality-security/ matching the existing naming convention
  • Main README.md entries are in alphabetical order within the category section
  • Category README.md includes agent descriptions with Use when: guidance
  • Quick Selection Guide table updated with new rows
  • Pipeline source link is correct: https://github.com/herakles-dev/opensource-pipeline

🤖 Generated with Claude Code

herakles-dev and others added 4 commits March 31, 2026 06:56
Three agents forming a complete open-source release pipeline:
- opensource-forker: strips secrets, replaces internal refs, generates .env.example
- opensource-sanitizer: independent PASS/FAIL audit with 30+ secret patterns
- opensource-packager: generates CLAUDE.md, setup.sh, README, LICENSE, CONTRIBUTING

All three agents added to categories/04-quality-security/ with descriptions
updated in both the category README and main README in alphabetical order.

Pipeline source: https://github.com/herakles-dev/opensource-pipeline
…1/P2 review fixes)

- forker: rsync now excludes .env* (variants), .claude/, .secrets/, secrets/
- forker/sanitizer: AWS regex covers aws_secret alias + case-insensitive, min 20 chars
- forker/sanitizer: JWT pattern requires full 3-segment structure
- forker/sanitizer: GitHub tokens cover all token types (gh[pousr]_), min length 36+
- forker: generic env pattern increases min length 8→16, adds non-secret lookaheads
- sanitizer: high-entropy WARNING separated from CRITICAL scan categories
- sanitizer: internal path detection broadened to macOS /Users/ and Windows C:\Users\
- sanitizer: read-only rule clarified to "never modify source files — only generate reports"
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Addresses 28 review comments plus 5 previously-unresolved findings:

- Nuanced *.pem/*.key handling (test certs vs real private keys)
- Explicit guard against real secret values in .env.example
- RFC1918 172.16-31.x.x range in forker replacement table
- Windows/macOS path detection with multi-char usernames
- Sanitizer validates .env.example contains only placeholders
- Packager uses indented template blocks for nested fenced code

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@necatiozmen
Copy link
Copy Markdown
Member

Thanks for the submission. However, the project looks quite new at the moment. Please feel free to open a new PR once it has matured a bit more. In the meantime, we recommend reviewing the contribution guide.

@necatiozmen necatiozmen closed this Apr 1, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants