Automatic Tuning Of Hyper-parameters Of Reinforcement Learning Algorithms Using Bayesian Optimization With Behavioral Cloning
2021 · Juan Cruz Barsce, Jorge A. Palombarini, Ernesto C. Martínez
Abstract
Optimal setting of several hyper-parameters in machine learning algorithms is key to make the most of available data. To this aim, several methods such as evolutionary strategies, random search, Bayesian optimization and heuristic rules of thumb have been proposed. In reinforcement learning (RL), the information content of data gathered by the learning agent while interacting with its environment is heavily dependent on the setting of many hyper-parameters. Therefore, the user of an RL algorithm has to rely on search-based optimization methods, such as grid search or the Nelder-Mead simplex algorithm, that are very inefficient for most RL tasks, slows down significantly the learning curve and leaves to the user the burden of purposefully biasing data gathering. In this work, in order to make an RL algorithm more user-independent, a novel approach for autonomous hyper-parameter setting using Bayesian optimization is proposed. Data from past episodes and different hyper-parameter values
Authors
(none)
Tags
Stats
Related papers
- Hyperparameter Tuning For Deep Reinforcement Learning Applications (2022)0.00
- A Hierarchical Two-tier Approach To Hyper-parameter Optimization In Reinforcement Learning (2019)0.00
- Generalized Population-based Training For Hyperparameter Optimization In Reinforcement Learning (2024)9.59
- Sample-efficient Automated Deep Reinforcement Learning (2020)0.00
- A Framework For History-aware Hyperparameter Optimisation In Reinforcement Learning (2023)0.00
- Data Efficient Training For Reinforcement Learning With Adaptive Behavior Policy Sharing (2020)0.00
- On Hyper-parameter Tuning For Stochastic Optimization Algorithms (2020)0.00
- Robust Model-free Reinforcement Learning With Multi-objective Bayesian Optimization (2019)11.08