Awesome Similarity Search
π
Papers
π§
Topics
π₯
Trending
πΊοΈ
Map
π
Leaderboards
π
Learn
π€
Ask AI
β―
More
π₯
Authors
π
Reading Packs
π οΈ
Tools
π
Blogs
βοΈ
Newsletter
π
Saved
+ Add Paper
βΎ
β
β authors
Β·
overview
Yuxuan Zhu
4
papers Β·
0
citations
Most-cited papers
Terminal-bench: Benchmarking Agents On Hard, Realistic Tasks In Command Line Interfaces
2026
Establishing Best Practices For Building Rigorous Agentic Benchmarks
2025
Topics
Benchmarks
Code Agents
π€
Ask AI