Awesome Papers
LLMs
Quantum
SimSearch
AI4Code
Agents
CV
Robotics
Cyber
AI4Sci
Speech
RL
MM
GenAI
Graph
TS
RecSys
FL
☾
☀
← authors
·
overview
Tong Yang
5
papers ·
0
citations
Most-cited papers
Claw-eval: Towards Trustworthy Evaluation Of Autonomous Agents
2026
Nl2repo-bench: Towards Long-horizon Repository Generation Evaluation Of Coding Agents
2025
Top co-authors
Ge Zhang
· 1
He Zhu
· 1
Jiaheng Liu
· 1
Jian Yang
· 1
Lei Li
· 1
Lei Yu
· 1
Lingpeng Kong
· 1
Minghao Liu
· 1
Qian Liu
· 1
Wenhao Huang
· 1
Xiang Gao
· 1
Yujia Qin
· 1
Topics
Evaluation
Code Agents
Multi-Agent
Browser Agents
Safety
Benchmarks
🤖
Ask AI