On The Mistaken Assumption Of Interchangeable Deep Reinforcement Learning Implementations
2025 Β· Rajdeep Singh Hundal, Yan Xiao, Xiaochun Cao, et al.
Abstract
Deep Reinforcement Learning (DRL) is a paradigm of artificial intelligence where an agent uses a neural network to learn which actions to take in a given environment. DRL has recently gained traction from being able to solve complex environments like driving simulators, 3D robotic control, and multiplayer-online-battle-arena video games. Numerous implementations of the state-of-the-art algorithms responsible for training these agents, like the Deep Q-Network (DQN) and Proximal Policy Optimization (PPO) algorithms, currently exist. However, studies make the mistake of assuming implementations of the same algorithm to be consistent and thus, interchangeable. In this paper, through a differential testing lens, we present the results of studying the extent of implementation inconsistencies, their effect on the implementations' performance, as well as their impact on the conclusions of prior studies under the assumption of interchangeable implementations. The outcomes of our differential te
Authors
(none)
Tags
Stats
Related papers
- GDI: Rethinking What Makes Reinforcement Learning Different From Supervised Learning (2021)0.00
- Modern Deep Reinforcement Learning Algorithms (2019)0.00
- Control-optimized Deep Reinforcement Learning For Artificially Intelligent Autonomous Systems (2025)0.00
- A Practical Introduction To Deep Reinforcement Learning (2025)0.00
- Deterministic Implementations For Reproducibility In Deep Reinforcement Learning (2018)0.00
- Evaluating The Progress Of Deep Reinforcement Learning In The Real World: Aligning Domain-agnostic And Domain-specific Research (2021)0.00
- A Comparative Study Of Deep Reinforcement Learning Models: DQN Vs PPO Vs A2C (2024)0.00
- Dopamine: A Research Framework For Deep Reinforcement Learning (2018)0.00