Safe Multi-agent Reinforcement Learning With Convergence To Generalized Nash Equilibrium
2024 Β· Zeyang Li, Navid Azizan
Abstract
Multi-agent reinforcement learning (MARL) has achieved notable success in cooperative tasks, demonstrating impressive performance and scalability. However, deploying MARL agents in real-world applications presents critical safety challenges. Current safe MARL algorithms are largely based on the constrained Markov decision process (CMDP) framework, which enforces constraints only on discounted cumulative costs and lacks an all-time safety assurance. Moreover, these methods often overlook the feasibility issue (the system will inevitably violate state constraints within certain regions of the constraint set), resulting in either suboptimal performance or increased constraint violations. To address these challenges, we propose a novel theoretical framework for safe MARL with \(\textit\{state-wise\}\) constraints, where safety requirements are enforced at every state the agents visit. To resolve the feasibility issue, we leverage a control-theoretic notion of the feasible region, the contr
Authors
(none)
Tags
Stats
Related papers
- Multi-agent Constrained Policy Optimisation (2021)0.00
- Deepsafempc: Deep Learning-based Model Predictive Control For Safe Multi-agent Reinforcement Learning (2024)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00
- Matrixworld: A Pursuit-evasion Platform For Safe Multi-agent Coordination And Autocurricula (2023)0.00
- Risk-sensitive Multi-agent Reinforcement Learning In Network Aggregative Markov Games (2024)0.00
- Diffusion Models For Offline Multi-agent Reinforcement Learning With Safety Constraints (2024)0.00
- Certified Policy Smoothing For Cooperative Multi-agent Reinforcement Learning (2022)5.24
- Risk-aware Distributed Multi-agent Reinforcement Learning (2023)3.58