Awesome Papers
LLMsQuantumSimSearchAI4CodeAgentsCVRoboticsCyberAI4SciSpeechRLMMGenAIGraphTSRecSysFL

← authors · overview

Dawn Song

14 papers · 8928 citations
Most-cited papers
  • VERINA: Benchmarking Verifiable Code Generation
    2025
  • CUBE: A Standard For Unifying Agent Benchmarks
    2026
  • Opensage: Self-programming Agent Generation Engine
    2026
  • Devops-gym: Benchmarking AI Agents In Software Devops Cycle
    2026
Top co-authors
Hongwei Li · 2Kaijie Zhu · 2Wenbo Guo · 2Graham Neubig · 1Tao Yu · 1William Yang Wang · 1Yu Su · 1
Topics
BenchmarksCode AgentsEvaluationBrowser AgentsMulti-Agent

Privacy · Terms

© 2026 Awesome Papers.