Evolutionary Population Curriculum For Scaling Multi-agent Reinforcement Learning
2020 Β· Qian Long, Zihan Zhou, Abhibav Gupta, et al.
Abstract
In multi-agent games, the complexity of the environment can grow exponentially as the number of agents increases, so it is particularly challenging to learn good policies when the agent population is large. In this paper, we introduce Evolutionary Population Curriculum (EPC), a curriculum learning paradigm that scales up Multi-Agent Reinforcement Learning (MARL) by progressively increasing the population of training agents in a stage-wise manner. Furthermore, EPC uses an evolutionary approach to fix an objective misalignment issue throughout the curriculum: agents successfully trained in an early stage with a small population are not necessarily the best candidates for adapting to later stages with scaled populations. Concretely, EPC maintains multiple sets of agents in each stage, performs mix-and-match and fine-tuning over these sets and promotes the sets of agents with the best adaptability to the next stage. We implement EPC on a popular MARL algorithm, MADDPG, and empirically show
Authors
(none)
Tags
Stats
Related papers
- Towards Skilled Population Curriculum For Multi-agent Reinforcement Learning (2023)0.00
- Evolution Of Societies Via Reinforcement Learning (2024)0.00
- Learning Progress Driven Multi-agent Curriculum (2022)0.00
- Generative Evolutionary Meta-solver (GEMS): Scalable Surrogate-free Multi-agent Reinforcement Learning (2025)0.00
- Stackelberg Games For Learning Emergent Behaviors During Competitive Autocurricula (2023)5.84
- Policyevolve: Evolving Programmatic Policies By Llms For Multi-player Games Via Population-based Training (2025)0.00
- Discovering Multiagent Learning Algorithms With Large Language Models (2026)2.05
- Game Theory And Multi-agent Reinforcement Learning : From Nash Equilibria To Evolutionary Dynamics (2024)0.00