Matrixworld: A Pursuit-evasion Platform For Safe Multi-agent Coordination And Autocurricula
2023 Β· Lijun Sun, Yu-Cheng Chang, Chao Lyu, et al.
Abstract
Multi-agent reinforcement learning (MARL) achieves encouraging performance in solving complex tasks. However, the safety of MARL policies is one critical concern that impedes their real-world applications. Popular multi-agent benchmarks focus on diverse tasks yet provide limited safety support. Therefore, this work proposes a safety-constrained multi-agent environment: MatrixWorld, based on the general pursuit-evasion game. Particularly, a safety-constrained multi-agent action execution model is proposed for the software implementation of safe multi-agent environments based on diverse safety definitions. It (1) extends the vertex conflict among homogeneous / cooperative agents to heterogeneous / adversarial settings, and (2) proposes three types of resolutions for each type of conflict, aiming at providing rational and unbiased feedback for safe MARL. Besides, MatrixWorld is also a lightweight co-evolution framework for the learning of pursuit tasks, evasion tasks, or both, where more
Authors
(none)
Tags
Stats
Related papers
- Safe Multi-agent Reinforcement Learning With Convergence To Generalized Nash Equilibrium (2024)0.00
- Multi-agent Constrained Policy Optimisation (2021)0.00
- ROMAX: Certifiably Robust Deep Multiagent Reinforcement Learning Via Convex Relaxation (2021)0.00
- SUB-PLAY: Adversarial Policies Against Partially Observed Multi-agent Reinforcement Learning Systems (2024)0.00
- Environment Complexity And Nash Equilibria In A Sequential Social Dilemma (2024)0.00
- MESA: Cooperative Meta-exploration In Multi-agent Learning Through Exploiting State-action Space Structure (2024)2.26
- Scalable Reinforcement Learning Policies For Multi-agent Control (2020)10.21
- Marllib: A Scalable And Efficient Multi-agent Reinforcement Learning Library (2022)0.00