HMRL: Hyper-meta Learning For Sparse Reward Reinforcement Learning Problem
2020 Β· Yun Hua, Xiangfeng Wang, Bo Jin, et al.
Abstract
In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward. In this respect, we develop a novel meta reinforcement learning framework called Hyper-Meta RL(HMRL), for sparse reward RL problems. It is consisted with three modules including the cross-environment meta state embedding module which constructs a common meta state space to adapt to different environments; the meta state based environment-specific meta reward shaping which effectively extends the original sparse reward trajectory by cross-environmental knowledge complementarity and as a consequence the meta policy achieves better generalization and efficiency with the shaped meta reward. Experiments with sparse-reward environments show the superiority of HMRL on both transferability and policy learning efficiency.
Authors
(none)
Tags
Stats
Related papers
- Exploration In Approximate Hyper-state Space For Meta Reinforcement Learning (2020)0.00
- Boosting Hierarchical Reinforcement Learning With Meta-learning For Complex Task Adaptation (2024)0.00
- Learning Action Translator For Meta Reinforcement Learning On Sparse-reward Tasks (2022)4.52
- A Tutorial On Meta-reinforcement Learning (2023)10.85
- RL\(^3\): Boosting Meta Reinforcement Learning Via RL Inside RL\(^2\) (2023)0.00
- Learning To Reinforcement Learn (2016)0.00
- Context Meta-reinforcement Learning Via Neuromodulation (2021)6.34
- Guided Meta-policy Search (2019)0.00