RMIO: A Model-based MARL Framework For Scenarios With Observation Loss In Some Agents
2024 Β· Zifeng Shi, Meiqin Liu, Senlin Zhang, et al.
Abstract
In recent years, model-based reinforcement learning (MBRL) has emerged as a solution to address sample complexity in multi-agent reinforcement learning (MARL) by modeling agent-environment dynamics to improve sample efficiency. However, most MBRL methods assume complete and continuous observations from each agent during the inference stage, which can be overly idealistic in practical applications. A novel model-based MARL approach called RMIO is introduced to address this limitation, specifically designed for scenarios where observation is lost in some agent. RMIO leverages the world model to reconstruct missing observations, and further reduces reconstruction errors through inter-agent information integration to ensure stable multi-agent decision-making. Secondly, unlike CTCE methods such as MAMBA, RMIO adopts the CTDE paradigm in standard environment, and enabling limited communication only when agents lack observation data, thereby reducing reliance on communication. Additionally, R
Authors
(none)
Tags
Stats
Related papers
- Inducing Cooperation Via Team Regret Minimization Based Multi-agent Deep Reinforcement Learning (2019)0.00
- LERO: Llm-driven Evolutionary Framework With Hybrid Rewards And Enhanced Observation For Multi-agent Reinforcement Learning (2025)3.58
- Model-based Multi-agent Reinforcement Learning: Recent Progress And Prospects (2022)0.00
- Cooperative Multi-agent RL With Communication Constraints (2026)0.00
- Incentivize Without Bonus: Provably Efficient Model-based Online Multi-agent RL For Markov Games (2025)0.00
- Remembering The Markov Property In Cooperative MARL (2025)0.00
- Causal Model-based Reinforcement Learning For Sample-efficient Iot Channel Access (2025)0.00
- An Analysis Of Model-based Reinforcement Learning From Abstracted Observations (2022)0.00