Wonder Wins Ways: Curiosity-driven Exploration Through Multi-agent Contextual Calibration
2025 Β· Yiyuan Pan, Zhe Liu, Hesheng Wang
Abstract
Autonomous exploration in complex multi-agent reinforcement learning (MARL) with sparse rewards critically depends on providing agents with effective intrinsic motivation. While artificial curiosity offers a powerful self-supervised signal, it often confuses environmental stochasticity with meaningful novelty. Moreover, existing curiosity mechanisms exhibit a uniform novelty bias, treating all unexpected observations equally. However, peer behavior novelty, which encode latent task dynamics, are often overlooked, resulting in suboptimal exploration in decentralized, communication-free MARL settings. To this end, inspired by how human children adaptively calibrate their own exploratory behaviors via observing peers, we propose a novel approach to enhance multi-agent exploration. We introduce CERMIC, a principled framework that empowers agents to robustly filter noisy surprise signals and guide exploration by dynamically calibrating their intrinsic curiosity with inferred multi-agent con
Authors
(none)
Tags
Stats
Related papers
- Curiosity-driven Multi-agent Exploration With Mixed Objectives (2022)0.00
- Episodic Multi-agent Reinforcement Learning With Curiosity-driven Exploration (2021)0.00
- Curiosity-driven Exploration Via Latent Bayesian Surprise (2021)0.00
- Settling Decentralized Multi-agent Coordinated Exploration By Novelty Sharing (2024)4.52
- A Unified Strategy For Implementing Curiosity And Empowerment Driven Reinforcement Learning (2018)0.00
- Is Curiosity All You Need? On The Utility Of Emergent Behaviours From Curious Exploration (2021)0.00
- Dynamic Memory-based Curiosity: A Bootstrap Approach For Exploration (2022)0.00
- Prototyping Three Key Properties Of Specific Curiosity In Computational Reinforcement Learning (2022)0.00