Awesome Similarity Search
πŸ“„Papers🧭TopicsπŸ‘₯AuthorsπŸ”₯TrendingπŸ—ΊοΈMapπŸ†LeaderboardsπŸ“šPacksπŸ› οΈToolsπŸ“BlogsπŸ€–Ask AIβœ‰οΈNewsletterπŸš€Pro
+ Add Paper

← authors Β· overview

Lin Qiu

6 papers Β· 4 citations
Most-cited papers
  • Automatically Benchmarking LLM Code Agents Through Agent-driven Annotation And Evaluation
    2025
  • Catarena: Evaluating Evolutionary Capabilities Of Code Agents Via Iterative Tournaments
    2025
  • Amemgym: Interactive Memory Benchmarking For Assistants In Long-horizon Conversations
    2026
Topics
EvaluationCode AgentsBenchmarksMulti-AgentMemory

Stay Updated

E-Mail Digest

Submit a paper Β· Privacy Β· Terms

Β© 2026 Awesome Papers.