A Comparative Study Of Deep Reinforcement Learning Models: DQN Vs PPO Vs A2C
2024 Β· Neil de La Fuente, Daniel A. Vidal Guerra
Abstract
This study conducts a comparative analysis of three advanced Deep Reinforcement Learning models: Deep Q-Networks (DQN), Proximal Policy Optimization (PPO), and Advantage Actor-Critic (A2C), within the BreakOut Atari game environment. Our research assesses the performance and effectiveness of these models in a controlled setting. Through rigorous experimentation, we examine each model's learning efficiency, strategy development, and adaptability under dynamic game conditions. The findings provide critical insights into the practical applications of these models in game-based learning environments and contribute to the broader understanding of their capabilities. The code is publicly available at github.com/Neilus03/DRL_comparative_study.
Authors
(none)
Tags
Stats
Related papers
- A2C Is A Special Case Of PPO (2022)0.00
- On The Mistaken Assumption Of Interchangeable Deep Reinforcement Learning Implementations (2025)0.00
- Does DQN Really Learn? Exploring Adversarial Training Schemes In Pong (2022)0.00
- A Human Mixed Strategy Approach To Deep Reinforcement Learning (2018)7.50
- A Practical Introduction To Deep Reinforcement Learning (2025)0.00
- Transforming Game Play: A Comparative Study Of DCQN And DTQN Architectures In Reinforcement Learning (2024)0.00
- On Improving Deep Reinforcement Learning For Pomdps (2017)0.00
- Efficient Deep Reinforcement Learning With Predictive Processing Proximal Policy Optimization (2022)0.00