Generalized Population-based Training For Hyperparameter Optimization In Reinforcement Learning
2024 Β· Hui Bai, Ran Cheng
Abstract
Hyperparameter optimization plays a key role in the machine learning domain. Its significance is especially pronounced in reinforcement learning (RL), where agents continuously interact with and adapt to their environments, requiring dynamic adjustments in their learning trajectories. To cater to this dynamicity, the Population-Based Training (PBT) was introduced, leveraging the collective intelligence of a population of agents learning simultaneously. However, PBT tends to favor high-performing agents, potentially neglecting the explorative potential of agents on the brink of significant advancements. To mitigate the limitations of PBT, we present the Generalized Population-Based Training (GPBT), a refined framework designed for enhanced granularity and flexibility in hyperparameter adaptation. Complementing GPBT, we further introduce Pairwise Learning (PL). Instead of merely focusing on elite agents, PL employs a comprehensive pairwise strategy to identify performance differentials a
Authors
(none)
Tags
Stats
Related papers
- Hyperparameter Tuning For Deep Reinforcement Learning Applications (2022)0.00
- Simultaneous Training Of First- And Second-order Optimizers In Population-based Reinforcement Learning (2024)0.00
- Automatic Tuning Of Hyper-parameters Of Reinforcement Learning Algorithms Using Bayesian Optimization With Behavioral Cloning (2021)0.00
- A Hierarchical Two-tier Approach To Hyper-parameter Optimization In Reinforcement Learning (2019)0.00
- A Method For Evaluating Hyperparameter Sensitivity In Reinforcement Learning (2024)0.00
- Data Efficient Training For Reinforcement Learning With Adaptive Behavior Policy Sharing (2020)0.00
- Sample-efficient Automated Deep Reinforcement Learning (2020)0.00
- Hyperparameter Optimisation With Practical Interpretability And Explanation Methods In Probabilistic Curriculum Learning (2025)0.00