Sample-efficient Automated Deep Reinforcement Learning
2020 · Jörg K. H. Franke, Gregor Köhler, André Biedenkapp, et al.
Abstract
Despite significant progress in challenging problems across various domains, applying state-of-the-art deep reinforcement learning (RL) algorithms remains challenging due to their sensitivity to the choice of hyperparameters. This sensitivity can partly be attributed to the non-stationarity of the RL problem, potentially requiring different hyperparameter settings at various stages of the learning process. Additionally, in the RL setting, hyperparameter optimization (HPO) requires a large number of environment interactions, hindering the transfer of the successes in RL to real-world applications. In this work, we tackle the issues of sample-efficient and dynamic HPO in RL. We propose a population-based automated RL (AutoRL) framework to meta-optimize arbitrary off-policy RL algorithms. In this framework, we optimize the hyperparameters and also the neural architecture while simultaneously training the agent. By sharing the collected experience across the population, we substantially in
Authors
(none)
Tags
Stats
Related papers
- Arlbench: Flexible And Efficient Benchmarking For Hyperparameter Optimization In Reinforcement Learning (2024)0.00
- Hyperparameter Tuning For Deep Reinforcement Learning Applications (2022)0.00
- Human-inspired Framework To Accelerate Reinforcement Learning (2023)0.00
- Data Efficient Training For Reinforcement Learning With Adaptive Behavior Policy Sharing (2020)0.00
- Automatic Tuning Of Hyper-parameters Of Reinforcement Learning Algorithms Using Bayesian Optimization With Behavioral Cloning (2021)0.00
- Adaptive \(q\)-network: On-the-fly Target Selection For Deep Reinforcement Learning (2024)0.00
- Generalized Population-based Training For Hyperparameter Optimization In Reinforcement Learning (2024)9.59
- Automated Reinforcement Learning: An Overview (2022)0.00