Vqc-based Reinforcement Learning With Data Re-uploading: Performance And Trainability
2024 · Rodrigo Coelho, André Sequeira, Luís Paulo Santos
Abstract
Reinforcement Learning (RL) consists of designing agents that make intelligent decisions without human supervision. When used alongside function approximators such as Neural Networks (NNs), RL is capable of solving extremely complex problems. Deep Q-Learning, a RL algorithm that uses Deep NNs, achieved super-human performance in some specific tasks. Nonetheless, it is also possible to use Variational Quantum Circuits (VQCs) as function approximators in RL algorithms. This work empirically studies the performance and trainability of such VQC-based Deep Q-Learning models in classic control benchmark environments. More specifically, we research how data re-uploading affects both these metrics. We show that the magnitude and the variance of the gradients of these models remain substantial throughout training due to the moving targets of Deep Q-Learning. Moreover, we empirically show that increasing the number of qubits does not lead to an exponential vanishing behavior of the magnitude and
Authors
(none)
Tags
Stats
Related papers
- Quantum Reinforcement Learning By Adaptive Non-local Observables (2025)2.26
- Variational Quantum Circuits For Deep Reinforcement Learning (2019)19.19
- Efficient Quantum Recurrent Reinforcement Learning Via Quantum Reservoir Computing (2023)0.00
- An Introduction To Quantum Reinforcement Learning (QRL) (2024)0.00
- Hybrid Quantum-classical Policy Gradient For Adaptive Control Of Cyber-physical Systems: A Comparative Study Of VQC Vs. MLP (2025)0.00
- Quantum Natural Policy Gradients: Towards Sample-efficient Reinforcement Learning (2023)7.16
- Benchmarking Quantum Reinforcement Learning (2025)0.00
- Curriculum-based Deep Reinforcement Learning For Quantum Control (2020)0.00