Evolutionary Reinforcement Learning For Sample-efficient Multiagent Coordination
2019 Β· Shauharda Khadka, Somdeb Majumdar, Santiago Miret, et al.
Abstract
Many cooperative multiagent reinforcement learning environments provide agents with a sparse team-based reward, as well as a dense agent-specific reward that incentivizes learning basic skills. Training policies solely on the team-based reward is often difficult due to its sparsity. Furthermore, relying solely on the agent-specific reward is sub-optimal because it usually does not capture the team coordination objective. A common approach is to use reward shaping to construct a proxy reward by combining the individual rewards. However, this requires manual tuning for each environment. We introduce Multiagent Evolutionary Reinforcement Learning (MERL), a split-level training platform that handles the two objectives separately through two optimization processes. An evolutionary algorithm maximizes the sparse team-based objective through neuroevolution on a population of teams. Concurrently, a gradient-based optimizer trains policies to only maximize the dense agent-specific rewards. The
Authors
(none)
Tags
Stats
Related papers
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Individual Specialization In Multi-task Environments With Multiagent Reinforcement Learners (2019)0.00
- LERO: Llm-driven Evolutionary Framework With Hybrid Rewards And Enhanced Observation For Multi-agent Reinforcement Learning (2025)3.58
- Strategic Coordination For Evolving Multi-agent Systems: A Hierarchical Reinforcement And Collective Learning Approach (2025)0.00
- Robust Multi-agent Reinforcement Learning With Social Empowerment For Coordination And Communication (2020)0.00
- Collaborative Evolutionary Reinforcement Learning (2019)0.00
- Influence-based Reinforcement Learning For Intrinsically-motivated Agents (2021)0.00
- Promoting Coordination Through Policy Regularization In Multi-agent Deep Reinforcement Learning (2019)0.00