Hierarchical Reinforcement Learning With Advantage-based Auxiliary Rewards
2019 Β· Siyuan Li, Rui Wang, Minxue Tang, et al.
Abstract
Hierarchical Reinforcement Learning (HRL) is a promising approach to solving long-horizon problems with sparse and delayed rewards. Many existing HRL algorithms either use pre-trained low-level skills that are unadaptable, or require domain-specific information to define low-level rewards. In this paper, we aim to adapt low-level skills to downstream tasks while maintaining the generality of reward design. We propose an HRL framework which sets auxiliary rewards for low-level skill training based on the advantage function of the high-level policy. This auxiliary reward enables efficient, simultaneous learning of the high-level policy and low-level skills without using task-specific knowledge. In addition, we also theoretically prove that optimizing low-level skills with this auxiliary reward will increase the task return for the joint policy. Experimental results show that our algorithm dramatically outperforms other state-of-the-art HRL methods in Mujoco domains. We also find both low
Authors
(none)
Tags
Stats
Related papers
- Boosting Hierarchical Reinforcement Learning With Meta-learning For Complex Task Adaptation (2024)0.00
- Hierarchical Reinforcement Learning Via Advantage-weighted Information Maximization (2019)0.00
- Bidirectional-reachable Hierarchical Reinforcement Learning With Mutually Responsive Policies (2024)0.00
- MENTOR: Guiding Hierarchical Reinforcement Learning With Human Feedback And Dynamic Distance Constraint (2024)6.34
- A Provably Efficient Option-based Algorithm For Both High-level And Low-level Learning (2024)0.00
- Deep Reinforcement Learning From Hierarchical Preference Design (2023)2.00
- Subgoal-based Hierarchical Reinforcement Learning For Multi-agent Collaboration (2024)0.00
- Learning Representations In Model-free Hierarchical Reinforcement Learning (2018)11.49