State-conditioned Adversarial Subgoal Generation
2022 Β· Vivienne Huiling Wang, Joni Pajarinen, Tinghuai Wang, et al.
Abstract
Hierarchical reinforcement learning (HRL) proposes to solve difficult tasks by performing decision-making and control at successively higher levels of temporal abstraction. However, off-policy HRL often suffers from the problem of a non-stationary high-level policy since the low-level policy is constantly changing. In this paper, we propose a novel HRL approach for mitigating the non-stationarity by adversarially enforcing the high-level policy to generate subgoals compatible with the current instantiation of the low-level policy. In practice, the adversarial learning is implemented by training a simple state-conditioned discriminator network concurrently with the high-level policy which determines the compatibility level of subgoals. Comparison to state-of-the-art algorithms shows that our approach improves both learning efficiency and performance in challenging continuous control tasks.
Authors
(none)
Tags
Stats
Related papers
- Generating Adjacency-constrained Subgoals In Hierarchical Reinforcement Learning (2020)0.00
- Learning Representations In Model-free Hierarchical Reinforcement Learning (2018)11.49
- Subgoal-based Hierarchical Reinforcement Learning For Multi-agent Collaboration (2024)0.00
- Learning And Exploiting Multiple Subgoals For Fast Exploration In Hierarchical Reinforcement Learning (2019)0.00
- Bidirectional-reachable Hierarchical Reinforcement Learning With Mutually Responsive Policies (2024)0.00
- Goal Space Abstraction In Hierarchical Reinforcement Learning Via Reachability Analysis (2023)0.00
- MENTOR: Guiding Hierarchical Reinforcement Learning With Human Feedback And Dynamic Distance Constraint (2024)6.34
- Near-optimal Representation Learning For Hierarchical Reinforcement Learning (2018)0.00