Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In The Game Of Hanabi
2023 Β· Hadi Nekoei, Xutong Zhao, Janarthanan Rajendran, et al.
Abstract
Cooperative Multi-agent Reinforcement Learning (MARL) algorithms with Zero-Shot Coordination (ZSC) have gained significant attention in recent years. ZSC refers to the ability of agents to coordinate zero-shot (without additional interaction experience) with independently trained agents. While ZSC is crucial for cooperative MARL agents, it might not be possible for complex tasks and changing environments. Agents also need to adapt and improve their performance with minimal interaction with other agents. In this work, we show empirically that state-of-the-art ZSC algorithms have poor performance when paired with agents trained with different learning methods, and they require millions of interaction samples to adapt to these new partners. To investigate this issue, we formally defined a framework based on a popular cooperative multi-agent game called Hanabi to evaluate the adaptability of MARL methods. In particular, we created a diverse set of pre-trained agents and defined a new metri
Authors
(none)
Tags
Stats
Related papers
- Zero Shot Coordination For Sparse Reward Tasks With Diverse Reward Shapings (2026)0.00
- "other-play" For Zero-shot Coordination (2020)0.00
- Heterogeneous Multi-agent Zero-shot Coordination By Coevolution (2022)5.24
- Tackling Cooperative Incompatibility For Zero-shot Human-ai Coordination (2023)0.00
- Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination (2025)0.00
- Knowpc: Knowledge-driven Programmatic Reinforcement Learning For Zero-shot Coordination (2024)0.00
- Noisy Zero-shot Coordination: Breaking The Common Knowledge Assumption In Zero-shot Coordination Games (2024)0.00
- Heterogeneous Multi-agent Reinforcement Learning For Zero-shot Scalable Collaboration (2024)6.34