A Survey Of Learning In Multiagent Environments: Dealing With Non-stationarity
2017 Β· Pablo Hernandez-Leal, Michael Kaisers, Tim Baarslag, et al.
Abstract
The key challenge in multiagent learning is learning a best response to the behaviour of other agents, which may be non-stationary: if the other agents adapt their strategy as well, the learning target moves. Disparate streams of research have approached non-stationarity from several angles, which make a variety of implicit assumptions that make it hard to keep an overview of the state of the art and to validate the innovation and significance of new works. This survey presents a coherent overview of work that addresses opponent-induced non-stationarity with tools from game theory, reinforcement learning and multi-armed bandits. Further, we reflect on the principle approaches how algorithms model and cope with this non-stationarity, arriving at a new framework and five categories (in increasing order of sophistication): ignore, forget, respond to target models, learn models, and theory of mind. A wide range of state-of-the-art algorithms is classified into a taxonomy, using these categ
Authors
(none)
Tags
Stats
Related papers
- A Black-box Approach For Non-stationary Multi-agent Reinforcement Learning (2023)0.00
- Non-cooperative Multi-agent Systems With Exploring Agents (2020)0.00
- Developing, Evaluating And Scaling Learning Agents In Multi-agent Environments (2022)2.26
- Independent Learning In Stochastic Games (2021)6.77
- A Policy Gradient Algorithm For Learning To Learn In Multiagent Reinforcement Learning (2020)0.00
- Deep Multiagent Reinforcement Learning: Challenges And Directions (2021)0.00
- Learning To Model Opponent Learning (2020)0.00
- Algorithms In Multi-agent Systems: A Holistic Perspective From Reinforcement Learning And Game Theory (2020)0.00