Discovering Individual Rewards In Collective Behavior Through Inverse Multi-agent Reinforcement Learning
2023 Β· Daniel Waelchli, Pascal Weber, Petros Koumoutsakos
Abstract
The discovery of individual objectives in collective behavior of complex dynamical systems such as fish schools and bacteria colonies is a long-standing challenge. Inverse reinforcement learning is a potent approach for addressing this challenge but its applicability to dynamical systems, involving continuous state-action spaces and multiple interacting agents, has been limited. In this study, we tackle this challenge by introducing an off-policy inverse multi-agent reinforcement learning algorithm (IMARL). Our approach combines the ReF-ER techniques with guided cost learning. By leveraging demonstrations, our algorithm automatically uncovers the reward function and learns an effective policy for the agents. Through extensive experimentation, we demonstrate that the proposed policy captures the behavior observed in the provided data, and achieves promising results across problem domains including single agent models in the OpenAI gym and multi-agent models of schooling behavior. The pr
Authors
(none)
Tags
Stats
Related papers
- Influence-based Reinforcement Learning For Intrinsically-motivated Agents (2021)0.00
- Inverse Reinforcement Learning In Swarm Systems (2016)2.26
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- DIFFER: Decomposing Individual Reward For Fair Experience Replay In Multi-agent Reinforcement Learning (2023)2.26
- Multi-agent Inverse Reinforcement Learning: Suboptimal Demonstrations And Alternative Solution Concepts (2021)0.00
- Social Influence As Intrinsic Motivation For Multi-agent Deep Reinforcement Learning (2018)0.00
- LIGS: Learnable Intrinsic-reward Generation Selection For Multi-agent Learning (2021)0.00
- On Feasible Rewards In Multi-agent Inverse Reinforcement Learning (2024)0.00