MIR: Efficient Exploration In Episodic Multi-agent Reinforcement Learning Via Mutual Intrinsic Reward
2025 Β· Kesheng Chen, Wenjian Luo, Bang Zhang, et al.
Abstract
Episodic rewards present a significant challenge in reinforcement learning. While intrinsic reward methods have demonstrated effectiveness in single-agent rein-forcement learning scenarios, their application to multi-agent reinforcement learn-ing (MARL) remains problematic. The primary difficulties stem from two fac-tors: (1) the exponential sparsity of joint action trajectories that lead to rewards as the exploration space expands, and (2) existing methods often fail to account for joint actions that can influence team states. To address these challenges, this paper introduces Mutual Intrinsic Reward (MIR), a simple yet effective enhancement strategy for MARL with extremely sparse rewards like episodic rewards. MIR incentivizes individual agents to explore actions that affect their teammates, and when combined with original strategies, effectively stimulates team exploration and improves algorithm performance. For comprehensive experimental valida-tion, we extend the representative si
Authors
(none)
Tags
Stats
Related papers
- Episodic Multi-agent Reinforcement Learning With Curiosity-driven Exploration (2021)0.00
- DEIR: Efficient And Robust Exploration Through Discriminative-model-based Episodic Intrinsic Rewards (2023)0.00
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Individual Contributions As Intrinsic Exploration Scaffolds For Multi-agent Reinforcement Learning (2024)2.80
- Influence-based Reinforcement Learning For Intrinsically-motivated Agents (2021)0.00
- REMAX: Relational Representation For Multi-agent Exploration (2020)2.26
- On Feasible Rewards In Multi-agent Inverse Reinforcement Learning (2024)0.00
- Never Explore Repeatedly In Multi-agent Reinforcement Learning (2023)0.00