Lin Qiu
6 papers Β· 4 citations
Most-cited papers
- Automatically Benchmarking LLM Code Agents Through Agent-driven Annotation And Evaluation2025
- Catarena: Evaluating Evolutionary Capabilities Of Code Agents Via Iterative Tournaments2025
- Amemgym: Interactive Memory Benchmarking For Assistants In Long-horizon Conversations2026
Topics