Malib: A Parallel Framework For Population-based Multi-agent Reinforcement Learning
2021 Β· Ming Zhou, Ziyu Wan, Hanjing Wang, et al.
Abstract
Population-based multi-agent reinforcement learning (PB-MARL) refers to the series of methods nested with reinforcement learning (RL) algorithms, which produces a self-generated sequence of tasks arising from the coupled population dynamics. By leveraging auto-curricula to induce a population of distinct emergent strategies, PB-MARL has achieved impressive success in tackling multi-agent tasks. Despite remarkable prior arts of distributed RL frameworks, PB-MARL poses new challenges for parallelizing the training frameworks due to the additional complexity of multiple nested workloads between sampling, training and evaluation involved with heterogeneous policy interactions. To solve these problems, we present MALib, a scalable and efficient computing framework for PB-MARL. Our framework is comprised of three key components: (1) a centralized task dispatching model, which supports the self-generated tasks and scalable training with heterogeneous policy combinations; (2) a programming arc
Authors
(none)
Tags
Stats
Related papers
- Marllib: A Scalable And Efficient Multi-agent Reinforcement Learning Library (2022)0.00
- Benchmarking Multi-agent Deep Reinforcement Learning Algorithms In Cooperative Tasks (2020)0.00
- MARL-LNS: Cooperative Multi-agent Reinforcement Learning Via Large Neighborhoods Search (2024)0.00
- Efficient Distributed Framework For Collaborative Multi-agent Reinforcement Learning (2022)0.00
- Towards Skilled Population Curriculum For Multi-agent Reinforcement Learning (2023)0.00
- Evolution Of Societies Via Reinforcement Learning (2024)0.00
- Characterizing Speed Performance Of Multi-agent Reinforcement Learning (2023)4.52
- Hypermarl: Adaptive Hypernetworks For Multi-agent RL (2024)0.00