Communicating Plans, Not Percepts: Scalable Multi-agent Coordination With Embodied World Models
2025 Β· Brennen A. Hill, Mant Koh En Wei, Thangavel Jishnuanandh
Abstract
Robust coordination is critical for effective decision-making in multi-agent systems, especially under partial observability. A central question in Multi-Agent Reinforcement Learning (MARL) is whether to engineer communication protocols or learn them end-to-end. We investigate this dichotomy using embodied world models. We propose and compare two communication strategies for a cooperative task-allocation problem. The first, Learned Direct Communication (LDC), learns a protocol end-to-end. The second, Intention Communication, uses an engineered inductive bias: a compact, learned world model, the Imagined Trajectory Generation Module (ITGM), which uses the agent's own policy to simulate future states. A Message Generation Network (MGN) then compresses this plan into a message. We evaluate these approaches on goal-directed interaction in a grid world, a canonical abstraction for embodied AI problems, while scaling environmental complexity. Our experiments reveal that while emergent commun
Authors
(none)
Tags
Stats
Related papers
- Learning Emergent Discrete Message Communication For Cooperative Reinforcement Learning (2021)5.24
- Robust Multi-agent Reinforcement Learning With Social Empowerment For Coordination And Communication (2020)0.00
- Language-driven Coordination And Learning In Multi-agent Simulation Environments (2025)0.00
- Contextual Knowledge Sharing In Multi-agent Reinforcement Learning With Decentralized Communication And Coordination (2025)0.00
- On The Role Of Emergent Communication For Social Learning In Multi-agent Reinforcement Learning (2023)0.00
- AC2C: Adaptively Controlled Two-hop Communication For Multi-agent Reinforcement Learning (2023)0.00
- Modeling Sensorimotor Coordination As Multi-agent Reinforcement Learning With Differentiable Communication (2019)0.00
- Coordinating Policies Among Multiple Agents Via An Intelligent Communication Channel (2022)0.00