Dynamic Subgoal-based Exploration Via Bayesian Optimization
2019 Β· Yijia Wang, Matthias Poloczek, Daniel R. Jiang
Abstract
Reinforcement learning in sparse-reward navigation environments with expensive and limited interactions is challenging and poses a need for effective exploration. Motivated by complex navigation tasks that require real-world training (when cheap simulators are not available), we consider an agent that faces an unknown distribution of environments and must decide on an exploration strategy. It may leverage a series of training environments to improve its policy before it is evaluated in a test environment drawn from the same environment distribution. Most existing approaches focus on fixed exploration strategies, while the few that view exploration as a meta-optimization problem tend to ignore the need for cost-efficient exploration. We propose a cost-aware Bayesian optimization approach that efficiently searches over a class of dynamic subgoal-based exploration strategies. The algorithm adjusts a variety of levers -- the locations of the subgoals, the length of each episode, and the nu
Authors
(none)
Tags
Stats
Related papers
- Fast Active Learning For Pure Exploration In Reinforcement Learning (2020)0.00
- Never Give Up: Learning Directed Exploration Strategies (2020)0.00
- Multi-objective Model-based Policy Search For Data-efficient Learning With Sparse Rewards (2018)0.00
- Long-term Visitation Value For Deep Exploration In Sparse Reward Reinforcement Learning (2020)7.24
- Exploration Conscious Reinforcement Learning Revisited (2018)0.00
- Coordinated Exploration Via Intrinsic Rewards For Multi-agent Reinforcement Learning (2019)0.00
- Model-based Active Exploration (2018)0.00
- DQN With Model-based Exploration: Efficient Learning On Environments With Sparse Rewards (2019)0.00