LIGS: Learnable Intrinsic-reward Generation Selection For Multi-agent Learning
2021 Β· David Henry Mguni, Taher Jafferjee, Jianhong Wang, et al.
Abstract
Efficient exploration is important for reinforcement learners to achieve high rewards. In multi-agent systems, coordinated exploration and behaviour is critical for agents to jointly achieve optimal outcomes. In this paper, we introduce a new general framework for improving coordination and performance of multi-agent reinforcement learners (MARL). Our framework, named Learnable Intrinsic-Reward Generation Selection algorithm (LIGS) introduces an adaptive learner, Generator that observes the agents and learns to construct intrinsic rewards online that coordinate the agents' joint exploration and joint behaviour. Using a novel combination of MARL and switching controls, LIGS determines the best states to learn to add intrinsic rewards which leads to a highly efficient learning process. LIGS can subdivide complex tasks making them easier to solve and enables systems of MARL agents to quickly solve environments with sparse rewards. LIGS can seamlessly adopt existing MARL algorithms and, ou
Authors
(none)
Tags
Stats
Related papers
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Individual Contributions As Intrinsic Exploration Scaffolds For Multi-agent Reinforcement Learning (2024)2.80
- Influence-based Reinforcement Learning For Intrinsically-motivated Agents (2021)0.00
- Towards Agentic Self-learning Llms In Search Environment (2025)0.00
- LERO: Llm-driven Evolutionary Framework With Hybrid Rewards And Enhanced Observation For Multi-agent Reinforcement Learning (2025)3.58
- Individual Specialization In Multi-task Environments With Multiagent Reinforcement Learners (2019)0.00
- REMAX: Relational Representation For Multi-agent Exploration (2020)2.26
- Discovering Individual Rewards In Collective Behavior Through Inverse Multi-agent Reinforcement Learning (2023)0.00