The Yokai Learning Environment: Tracking Beliefs Over Space And Time
2025 Β· Constantin Ruhdorfer, Matteo Bortoletto, Johannes Forkel, et al.
Abstract
The ability to cooperate with unknown partners is a central challenge in cooperative AI and widely studied in the form of zero-shot coordination (ZSC), which evaluates an algorithm by measuring the performance of independently trained agents when paired. The Hanabi Learning Environment (HLE) has become the dominant benchmark for ZSC, but recent work has achieved near-perfect inter-seed cross-play performance, limiting its ability to track algorithmic progress. We introduce the Yokai Learning Environment (YLE) - an open-source multi-agent RL benchmark in which effective collaboration requires building common ground by tracking and updating beliefs over moving cards, reasoning under ambiguous hints, and deciding when to terminate the game based on inferred shared knowledge - features absent in the HLE, where beliefs are tied to hand slots and hints are truthful by rule. We evaluate the leading ZSC methods, including High-Entropy IPPO, Other-Play, and Off-Belief Learning, which achieve ne
Authors
(none)
Tags
Stats
Related papers
- Generalized Beliefs For Cooperative AI (2022)0.00
- "other-play" For Zero-shot Coordination (2020)0.00
- Human-ai Coordination Via Human-regularized Search And Learning (2022)0.00
- Evaluation Of Human-ai Teams For Learned And Rule-based Agents In Hanabi (2021)0.00
- Simplified Action Decoder For Deep Multi-agent Reinforcement Learning (2019)4.03
- Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In The Game Of Hanabi (2023)0.00
- Tackling Cooperative Incompatibility For Zero-shot Human-ai Coordination (2023)0.00
- Behavioral Differences Is The Key Of Ad-hoc Team Cooperation In Multiplayer Games Hanabi (2023)0.00