Zhuoran Li
4 papers Β· 1 citations
Most-cited papers
- Offline-to-online Multi-agent Reinforcement Learning With Offline Value Function Memory And Sequential Exploration2024 Β· 1 citations
- Scoring, Reasoning, And Selecting The Best! Ensembling Large Language Models Via A Peer-review Process2026
- OM2P: Offline Multi-agent Mean-flow Policy2025
- Reparameterization Proximal Policy Optimization2025
Topics