Juewu-mc: Playing Minecraft With Sample-efficient Hierarchical Reinforcement Learning
2021 Β· Zichuan Lin, Junyou Li, Jianing Shi, et al.
Abstract
Learning rational behaviors in open-world games like Minecraft remains to be challenging for Reinforcement Learning (RL) research due to the compound challenge of partial observability, high-dimensional visual perception and delayed reward. To address this, we propose JueWu-MC, a sample-efficient hierarchical RL approach equipped with representation learning and imitation learning to deal with perception and exploration. Specifically, our approach includes two levels of hierarchy, where the high-level controller learns a policy to control over options and the low-level workers learn to solve each sub-task. To boost the learning of sub-tasks, we propose a combination of techniques including 1) action-aware representation learning which captures underlying relations between action and representation, 2) discriminator-based self-imitation learning for efficient exploration, and 3) ensemble behavior cloning with consistency filtering for policy robustness. Extensive experiments show that J
Authors
(none)
Tags
Stats
Related papers
- Improving Deep Reinforcement Learning In Minecraft With Action Advice (2019)9.03
- Open-world Multi-task Control Through Goal-aware Representation Learning And Adaptive Horizon Prediction (2023)8.09
- Multi-task Curriculum Learning In A Complex, Visual, Hard-exploration Domain: Minecraft (2021)0.00
- Explore, Exploit Or Listen: Combining Human Feedback And Policy Model To Speed Up Deep Reinforcement Learning In 3D Worlds (2017)0.00
- Hierarchical Reinforcement Learning In Complex 3D Environments (2023)0.00
- Learning Representations In Model-free Hierarchical Reinforcement Learning (2018)11.49
- Exploratory Gradient Boosting For Reinforcement Learning In Complex Domains (2016)0.00
- Hypothesis-driven Skill Discovery For Hierarchical Deep Reinforcement Learning (2019)2.26