← all datasets

Terminal-Bench 2.0

Emerging
2papers using it
13,597HF downloads
39HF likes
2026first seen

Warning: The leaderboard above is unofficial. The official leaderboard is https://www.tbench.ai/leaderboard/terminal-bench/2.0, in which entires are audited for correct configuration, results show which agent harness is used, and verified trajectories are publicly viewable. Warning: The dataset is a read-only mirror. T

Papers using Terminal-Bench 2.0 (2)

Terminal-Bench 2.0 β€” datasets β€” reinforcement-learning