Yi Wu
12 papers Β· 737 citations
Most-cited papers
- Is DPO Superior To PPO For LLM Alignment? A Comprehensive Study2024 Β· 273 citations
- Bitnet: Scaling 1-bit Transformers For Large Language Models2023 Β· 220 citations
- Language Agents With Reinforcement Learning For Strategic Play In The Werewolf Game2023 Β· 144 citations
- Llm-powered Hierarchical Language Agent For Real-time Human-ai Coordination2023 Β· 66 citations
- Real: Efficient RLHF Training Of Large Language Models With Parameter Reallocation2024 Β· 29 citations
- Beyond Ten Turns: Unlocking Long-horizon Agentic Search With Large-scale Asynchronous RL2025
- Focus On The Core: Empowering Diffusion Large Language Models By Self-contrast2026
- Learning Design And Construction With Varying-sized Materials Via Prioritized Memory Resets2022
Topics