Haitao Mi
12 papers Β· 619 citations
Most-cited papers
- Scaling Synthetic Data Creation With 1,000,000,000 Personas2024 Β· 345 citations
- Toward Self-improvement Of Llms Via Imagination, Searching, And Criticizing2024 Β· 138 citations
- Iterative Nash Policy Optimization: Aligning Llms With General Preferences Via No-regret Learning2024 Β· 40 citations
- The Trickle-down Impact Of Reward (in-)consistency On RLHF2023 Β· 29 citations
- Verified Critical Step Optimization For LLM Agents2026
- Inference-time Scaling Of Verification: Self-evolving Deep Research Agents Via Test-time Rubric-guided Verification2026
- Webaggregator: Enhancing Compositional Reasoning Capabilities Of Deep Research Agent Foundation Models2026
Topics