All

16 repositories

terminal-bench-science
Public
Terminal-Bench-Science: Evaluating AI Agents on Complex Real-World Scientific Workflows in the Terminal
ai4science ai-for-science agentic-ai
ai4science ai-for-science agentic-ai
Python
•
Apache License 2.0
•73•143•2•49•Updated Jun 17, 2026Jun 17, 2026
harbor
Public
Harbor is a framework for running agent evaluations and creating and using RL environments.
rl-environments evals terminal-bench
rl-environments evals terminal-bench
Python
•
Apache License 2.0
•1.2k•2.5k•138•288•Updated Jun 17, 2026Jun 17, 2026
terminal-bench-3
Public
Measuring agents' ability to get work done on a computer
Python
•289•241•3•112•Updated Jun 16, 2026Jun 16, 2026
harbor-adapters-experiments
Public
Python
•
Apache License 2.0
•12•6•0•0•Updated Jun 16, 2026Jun 16, 2026
t-bench-docs
Public
TypeScript
•14•7•2•0•Updated Jun 16, 2026Jun 16, 2026
terminal-bench-challenges
Public
Shell
•3•8•0•0•Updated Jun 16, 2026Jun 16, 2026
docs
Public
MDX
•
MIT License
•0•0•0•0•Updated Jun 3, 2026Jun 3, 2026
benchmark-template
Public template
Harbor Benchmark Template
Python
•10•12•7•7•Updated May 30, 2026May 30, 2026
awesome-harbor
Public
A curated list of awesome Harbor ecosystem projects
2•42•0•1•Updated May 29, 2026May 29, 2026
harbor-datasets
Public
114•33•6•20•Updated May 16, 2026May 16, 2026
skills
Public
Public agent skills catalog for Harbor
Apache License 2.0
•1•9•0•11•Updated May 12, 2026May 12, 2026
terminal-bench-2-1
Public
Terminal-Bench 2.1
Shell
•
Apache License 2.0
•6•21•1•7•Updated May 5, 2026May 5, 2026
terminal-bench-2
Public
Shell
•
Apache License 2.0
•87•288•17•19•Updated Apr 30, 2026Apr 30, 2026
harbor-cookbook
Public
Realistic examples of building evals and optimizing agents with Harbor
Python
•
Apache License 2.0
•10•104•0•1•Updated Apr 23, 2026Apr 23, 2026
harbor-docs
Public
MDX
•11•3•0•4•Updated Mar 31, 2026Mar 31, 2026
terminal-bench
Public
A benchmark for LLMs on complicated tasks in the terminal
Python
•
Apache License 2.0
•543•2.4k•112•190•Updated Jan 22, 2026Jan 22, 2026

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harbor

All

All

16 repositories

terminal-bench-science

harbor

terminal-bench-3

harbor-adapters-experiments

t-bench-docs

terminal-bench-challenges

docs

benchmark-template

awesome-harbor

harbor-datasets

skills

terminal-bench-2-1

terminal-bench-2

harbor-cookbook

harbor-docs

terminal-bench

All

All

Repositories list

16 repositories