Multi-agent Actor-critic For Mixed Cooperative-competitive Environments
2017 Β· Ryan Lowe, Yi Wu, Aviv Tamar, et al.
Abstract
We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment, while policy gradient suffers from a variance that increases as the number of agents grows. We then present an adaptation of actor-critic methods that considers action policies of other agents and is able to successfully learn policies that require complex multi-agent coordination. Additionally, we introduce a training regimen utilizing an ensemble of policies for each agent that leads to more robust multi-agent policies. We show the strength of our approach compared to existing methods in cooperative as well as competitive scenarios, where agent populations are able to discover various physical and informational coordination strategies.
Authors
(none)
Tags
Stats
Related papers
- Actor-attention-critic For Multi-agent Reinforcement Learning (2018)0.00
- Local Advantage Actor-critic For Robust Multi-agent Deep Reinforcement Learning (2021)7.81
- Bi-level Actor-critic For Multi-agent Coordination (2019)0.00
- A Multi-agent Off-policy Actor-critic Algorithm For Distributed Reinforcement Learning (2019)11.39
- Sa-matd3:self-attention-based Multi-agent Continuous Control Method In Cooperative Environments (2021)11.76
- Actor-critic Algorithms For Constrained Multi-agent Reinforcement Learning (2019)0.00
- Actor-critic Policy Optimization In Partially Observable Multiagent Environments (2018)0.00
- Decomposed Soft Actor-critic Method For Cooperative Multi-agent Reinforcement Learning (2021)0.00