Role Play: Learning Adaptive Role-specific Strategies In Multi-agent Interactions
2024 Β· Weifan Long, Wen Wen, Peng Zhai, et al.
Abstract
Zero-shot coordination problem in multi-agent reinforcement learning (MARL), which requires agents to adapt to unseen agents, has attracted increasing attention. Traditional approaches often rely on the Self-Play (SP) framework to generate a diverse set of policies in a policy pool, which serves to improve the generalization capability of the final agent. However, these frameworks may struggle to capture the full spectrum of potential strategies, especially in real-world scenarios that demand agents balance cooperation with competition. In such settings, agents need strategies that can adapt to varying and often conflicting goals. Drawing inspiration from Social Value Orientation (SVO)-where individuals maintain stable value orientations during interactions with others-we propose a novel framework called *Role Play* (RP). RP employs role embeddings to transform the challenge of policy diversity into a more manageable diversity of roles. It trains a common policy with role embedding obs
Authors
(none)
Tags
Stats
Related papers
- Heterogeneous Multi-agent Reinforcement Learning For Zero-shot Scalable Collaboration (2024)6.34
- Attention-guided Contrastive Role Representations For Multi-agent Reinforcement Learning (2023)3.64
- Hypermarl: Adaptive Hypernetworks For Multi-agent RL (2024)0.00
- Policy Diagnosis Via Measuring Role Diversity In Cooperative Multi-agent RL (2022)0.00
- Learning Zero-shot Cooperation With Humans, Assuming Humans Are Biased (2023)0.00
- "other-play" For Zero-shot Coordination (2020)0.00
- RODE: Learning Roles To Decompose Multi-agent Tasks (2020)0.00
- Spacer: Self-play Anchoring With Centralized Reference Models (2025)0.00