Hierarchical Reinforcement Learning With Opponent Modeling For Distributed Multi-agent Cooperation
2022 Β· Zhixuan Liang, Jiannong Cao, Shan Jiang, et al.
Abstract
Many real-world applications can be formulated as multi-agent cooperation problems, such as network packet routing and coordination of autonomous vehicles. The emergence of deep reinforcement learning (DRL) provides a promising approach for multi-agent cooperation through the interaction of the agents and environments. However, traditional DRL solutions suffer from the high dimensions of multiple agents with continuous action space during policy search. Besides, the dynamicity of agents' policies makes the training non-stationary. To tackle the issues, we propose a hierarchical reinforcement learning approach with high-level decision-making and low-level individual control for efficient policy search. In particular, the cooperation of multiple agents can be learned in high-level discrete action space efficiently. At the same time, the low-level individual control can be reduced to single-agent reinforcement learning. In addition to hierarchical reinforcement learning, we propose an opp
Authors
(none)
Tags
Stats
Related papers
- Hierarchical Reinforcement Learning For Optimal Agent Grouping In Cooperative Systems (2025)0.00
- HAVEN: Hierarchical Cooperative Multi-agent Reinforcement Learning With Dual Coordination Mechanism (2021)0.00
- Subgoal-based Hierarchical Reinforcement Learning For Multi-agent Collaboration (2024)0.00
- Deep Multi-agent Reinforcement Learning With Discrete-continuous Hybrid Action Spaces (2019)12.47
- Optimization For Reinforcement Learning: From Single Agent To Cooperative Agents (2019)14.62
- Fully Decentralized Cooperative Multi-agent Reinforcement Learning: A Survey (2024)0.00
- A New Framework For Multi-agent Reinforcement Learning -- Centralized Training And Exploration With Decentralized Execution Via Policy Distillation (2019)0.00
- Strategic Coordination For Evolving Multi-agent Systems: A Hierarchical Reinforcement And Collective Learning Approach (2025)0.00