← all datasets

Terminal-Bench 2.0

Emerging
2papers using it
13,527HF downloads
39HF likes
2026first seen

Warning: The leaderboard above is unofficial. The official leaderboard is https://www.tbench.ai/leaderboard/terminal-bench/2.0, in which entires are audited for correct configuration, results show which agent harness is used, and verified trajectories are publicly viewable. Warning: The dataset is a read-only mirror. The primary source for this dataset is on GitHub: https://github.com/harbor-framework/terminal-bench-2. Please open issues and pull requests there. How this mirror was created… See the full description on the dataset page: https://huggingface.co/datasets/harborframework/terminal-bench-2.0.

Papers using Terminal-Bench 2.0 (1)

Terminal-Bench 2.0 β€” datasets β€” ai-for-code