MABL: Bi-level Latent-variable World Model For Sample-efficient Multi-agent Reinforcement Learning
2023 Β· Aravind Venugopal, Stephanie Milani, Fei Fang, et al.
Abstract
Multi-agent reinforcement learning (MARL) methods often suffer from high sample complexity, limiting their use in real-world problems where data is sparse or expensive to collect. Although latent-variable world models have been employed to address this issue by generating abundant synthetic data for MARL training, most of these models cannot encode vital global information available during training into their latent states, which hampers learning efficiency. The few exceptions that incorporate global information assume centralized execution of their learned policies, which is impractical in many applications with partial observability. We propose a novel model-based MARL algorithm, MABL (Multi-Agent Bi-Level world model), that learns a bi-level latent-variable world model from high-dimensional inputs. Unlike existing models, MABL is capable of encoding essential global information into the latent states during training while guaranteeing the decentralized execution of learned policie
Authors
(none)
Tags
Stats
Related papers
- GAWM: Global-aware World Model For Multi-agent Reinforcement Learning (2025)0.00
- Decentralized Transformers With Centralized Aggregation Are Sample-efficient Multi-agent World Models (2024)0.00
- Hierarchical Deep Multiagent Reinforcement Learning With Temporal Abstraction (2018)0.00
- Model-based Multi-agent Reinforcement Learning: Recent Progress And Prospects (2022)0.00
- Representation Learning For Efficient Deep Multi-agent Reinforcement Learning (2024)0.00
- Efficient Model-based Multi-agent Reinforcement Learning Via Optimistic Equilibrium Computation (2022)0.00
- Malib: A Parallel Framework For Population-based Multi-agent Reinforcement Learning (2021)0.00
- World Models As An Intermediary Between Agents And The Real World (2026)0.00