Malthusian Reinforcement Learning
2018 Β· Joel Z. Leibo, Julien Perolat, Edward Hughes, et al.
Abstract
Here we explore a new algorithmic framework for multi-agent reinforcement learning, called Malthusian reinforcement learning, which extends self-play to include fitness-linked population size dynamics that drive ongoing innovation. In Malthusian RL, increases in a subpopulation's average return drive subsequent increases in its size, just as Thomas Malthus argued in 1798 was the relationship between preindustrial income levels and population growth. Malthusian reinforcement learning harnesses the competitive pressures arising from growing and shrinking population size to drive agents to explore regions of state and policy spaces that they could not otherwise reach. Furthermore, in environments where there are potential gains from specialization and division of labor, we show that Malthusian reinforcement learning is better positioned to take advantage of such synergies than algorithms based on self-play.
Authors
(none)
Tags
Stats
Related papers
- Evolution Of Societies Via Reinforcement Learning (2024)0.00
- Evolutionary Population Curriculum For Scaling Multi-agent Reinforcement Learning (2020)0.00
- Inclusive Fitness As A Key Step Towards More Advanced Social Behaviors In Multi-agent Reinforcement Learning Settings (2025)0.00
- Population-aware Online Mirror Descent For Mean-field Games By Deep Reinforcement Learning (2024)0.00
- Malib: A Parallel Framework For Population-based Multi-agent Reinforcement Learning (2021)0.00
- The Evolutionary Dynamics Of Independent Learning Agents In Population Games (2020)0.00
- Neural Auto-curricula (2021)0.00
- Algorithms In Multi-agent Systems: A Holistic Perspective From Reinforcement Learning And Game Theory (2020)0.00