Awesome Papers
LLMs
Quantum
SimSearch
AI4Code
Agents
CV
Robotics
Cyber
AI4Sci
Speech
RL
MM
GenAI
Graph
TS
RecSys
FL
☾
☀
← authors
·
overview
Yan Wang
36
papers ·
3
citations
Most-cited papers
Ml-bench: Evaluating Large Language Models And Agents For Machine Learning Tasks On Repository-level Code
2023 · 49 citations
Topics
Code
Evaluation
🤖
Ask AI