Qf-tuner: Breaking Tradition In Reinforcement Learning
2024 Β· Mahmood A. Jumaah, Yossra H. Ali, Tarik A. Rashid
Abstract
In reinforcement learning algorithms, the hyperparameters tuning method refers to choosing the optimal parameters that may increase the overall performance. Manual or random hyperparameter tuning methods can lead to different results in the reinforcement learning algorithms. In this paper, we propose a new method called QF-tuner for automatic hyperparameter tuning in the Q learning algorithm using the FOX optimization algorithm (FOX). Furthermore, a new objective function has been employed within FOX that prioritizes reward over learning error and time. QF tuner starts by running the FOX and tries to minimize the fitness value derived from observations at each iteration by executing the Q-learning algorithm. The proposed method has been evaluated using two control tasks from the OpenAI Gym: CartPole and FrozenLake. The empirical results indicate that the QF-tuner outperforms other optimization algorithms, such as particle swarm optimization (PSO), bees algorithm (BA), genetic algorithm
Authors
(none)
Tags
Stats
Related papers
- Simultaneous Training Of First- And Second-order Optimizers In Population-based Reinforcement Learning (2024)0.00
- On Hyper-parameter Tuning For Stochastic Optimization Algorithms (2020)0.00
- Hyperparameter Tuning For Deep Reinforcement Learning Applications (2022)0.00
- The Role Of Target Update Frequencies In Q-learning (2026)0.00
- Hyperparameter Selection Methods For Fitted Q-evaluation With Error Guarantee (2022)0.00
- Adaptive \(q\)-network: On-the-fly Target Selection For Deep Reinforcement Learning (2024)0.00
- Aggressive Q-learning With Ensembles: Achieving Both High Sample Efficiency And High Asymptotic Performance (2021)0.00
- XQC: Well-conditioned Optimization Accelerates Deep Reinforcement Learning (2025)0.00