Longbo Huang
4 papers Β· 1 citations
Most-cited papers
- Offline-to-online Multi-agent Reinforcement Learning With Offline Value Function Memory And Sequential Exploration2024 Β· 1 citations
- OM2P: Offline Multi-agent Mean-flow Policy2025
- Multi-path Policy Optimization2019
- Reparameterization Proximal Policy Optimization2025
Topics