Xuezhi Cao
7 papers Β· 4 citations
Most-cited papers
- Automatically Benchmarking LLM Code Agents Through Agent-driven Annotation And Evaluation2025
- Amemgym: Interactive Memory Benchmarking For Assistants In Long-horizon Conversations2026
- Longcat-flash-prover: Advancing Native Formal Reasoning Via Agentic Tool-integrated Reinforcement Learning2026
Topics