Skip to content

feat: Braintrust skill eval PoC#47

Closed
mattrossman wants to merge 5 commits intomainfrom
mattrossman/ai-468-braintrust-skill-eval-poc
Closed

feat: Braintrust skill eval PoC#47
mattrossman wants to merge 5 commits intomainfrom
mattrossman/ai-468-braintrust-skill-eval-poc

Conversation

@mattrossman
Copy link

@mattrossman mattrossman commented Mar 3, 2026

WIP playground for Braintrust evals.

Ref AI-468

- evals/main.ts: run claude -p Hello in container, capture output
- evals/Dockerfile: node:24-slim + @anthropic-ai/claude-code@2.1.63
- evals:build: build image (run after Dockerfile changes)
- evals:run: run evals (requires image built first)
- ANTHROPIC_API_KEY from .env via --env-file-if-exists
@coderabbitai
Copy link

coderabbitai bot commented Mar 3, 2026

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

  • 🔍 Trigger review

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@mattrossman mattrossman closed this Mar 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant