Adaptive \(q\)-network: On-the-fly Target Selection For Deep Reinforcement Learning
2024 Β· ThΓ©o Vincent, Fabian Wahren, Jan Peters, et al.
Abstract
Deep Reinforcement Learning (RL) is well known for being highly sensitive to hyperparameters, requiring practitioners substantial efforts to optimize them for the problem at hand. This also limits the applicability of RL in real-world scenarios. In recent years, the field of automated Reinforcement Learning (AutoRL) has grown in popularity by trying to address this issue. However, these approaches typically hinge on additional samples to select well-performing hyperparameters, hindering sample-efficiency and practicality. Furthermore, most AutoRL methods are heavily based on already existing AutoML methods, which were originally developed neglecting the additional challenges inherent to RL due to its non-stationarities. In this work, we propose a new approach for AutoRL, called Adaptive \(Q\)-Network (AdaQN), that is tailored to RL to take into account the non-stationarity of the optimization procedure without requiring additional samples. AdaQN learns several \(Q\)-functions, each one
Authors
(none)
Tags
Stats
Related papers
- Sample-efficient Automated Deep Reinforcement Learning (2020)0.00
- Deep Transfer \(q\)-learning For Offline Non-stationary Reinforcement Learning (2025)0.00
- Deep Q-networks For Accelerating The Training Of Deep Neural Networks (2016)0.00
- Eau De \(q\)-network: Adaptive Distillation Of Neural Networks In Deep Reinforcement Learning (2025)0.00
- An Integrated Approach To Neural Architecture Search For Deep Q-networks (2025)0.00
- Iterated \(q\)-network: Beyond One-step Bellman Updates In Deep Reinforcement Learning (2024)0.00
- Training A Resilient Q-network Against Observational Interference (2021)0.00
- Automaton Distillation: Neuro-symbolic Transfer Learning For Deep Reinforcement Learning (2023)0.00