Learning Meta Representations For Agents In Multi-agent Reinforcement Learning
2021 Β· Shenao Zhang, Li Shen, Lei Han, et al.
Abstract
In multi-agent reinforcement learning, the behaviors that agents learn in a single Markov Game (MG) are typically confined to the given agent number. Every single MG induced by varying the population may possess distinct optimal joint strategies and game-specific knowledge, which are modeled independently in modern multi-agent reinforcement learning algorithms. In this work, our focus is on creating agents that can generalize across population-varying MGs. Instead of learning a unimodal policy, each agent learns a policy set comprising effective strategies across a variety of games. To achieve this, we propose Meta Representations for Agents (MRA) that explicitly models the game-common and game-specific strategic knowledge. By representing the policy sets with multi-modal latent policies, the game-common strategic knowledge and diverse strategic modes are discovered through an iterative optimization procedure. We prove that by approximately maximizing the resulting constrained mutual i
Authors
(none)
Tags
Stats
Related papers
- Learning Policy Representations In Multiagent Systems (2018)0.00
- Extended Markov Games To Learn Multiple Tasks In Multi-agent Reinforcement Learning (2020)3.58
- Minimax-optimal Multi-agent RL In Markov Games With A Generative Model (2022)2.26
- Breaking The Curse Of Multiagency In Robust Multi-agent Reinforcement Learning (2024)0.00
- Meta-value Learning: A General Framework For Learning With Learning Awareness (2023)0.00
- Incentivize Without Bonus: Provably Efficient Model-based Online Multi-agent RL For Markov Games (2025)0.00
- A Policy Gradient Algorithm For Learning To Learn In Multiagent Reinforcement Learning (2020)0.00
- Generative Evolutionary Meta-solver (GEMS): Scalable Surrogate-free Multi-agent Reinforcement Learning (2025)0.00