Accelerating Distributed Deep Reinforcement Learning By In-network Experience Sampling
2021 Β· Masaki Furukawa, Hiroki Matsutani
Abstract
A computing cluster that interconnects multiple compute nodes is used to accelerate distributed reinforcement learning based on DQN (Deep Q-Network). In distributed reinforcement learning, Actor nodes acquire experiences by interacting with a given environment and a Learner node optimizes their DQN model. Since data transfer between Actor and Learner nodes increases depending on the number of Actor nodes and their experience size, communication overhead between them is one of major performance bottlenecks. In this paper, their communication is accelerated by DPDK-based network optimizations, and DPDK-based low-latency experience replay memory server is deployed between Actor and Learner nodes interconnected with a 40GbE (40Gbit Ethernet) network. Evaluation results show that, as a network optimization technique, kernel bypassing by DPDK reduces network access latencies to a shared memory server by 32.7% to 58.9%. As another network optimization technique, an in-network experience repla
Authors
(none)
Tags
Stats
Related papers
- Bootstrapping A DQN Replay Memory With Synthetic Experiences (2020)5.84
- Acceleration For Deep Reinforcement Learning Using Parallel And Distributed Computing: A Survey (2024)8.82
- Associative Memory Based Experience Replay For Deep Reinforcement Learning (2022)6.34
- DNS: Determinantal Point Process Based Neural Network Sampler For Ensemble Reinforcement Learning (2022)0.00
- Stratified Experience Replay: Correcting Multiplicity Bias In Off-policy Reinforcement Learning (2021)0.00
- Deep Q-networks For Accelerating The Training Of Deep Neural Networks (2016)0.00
- Stabilising Experience Replay For Deep Multi-agent Reinforcement Learning (2017)0.00
- Accelerated Methods For Deep Reinforcement Learning (2018)0.00