Accerl: Policy Acceleration Framework For Deep Reinforcement Learning
2022 Β· Hongjie Zhang
Abstract
Deep reinforcement learning has achieved great success in various fields with its super decision-making ability. However, the policy learning process requires a large amount of training time, causing energy consumption. Inspired by the redundancy of neural networks, we propose a lightweight parallel training framework based on neural network compression, AcceRL, to accelerate the policy learning while ensuring policy quality. Specifically, AcceRL speeds up the experience collection by flexibly combining various neural network compression methods. Overall, the AcceRL consists of five components, namely Actor, Learner, Compressor, Corrector, and Monitor. The Actor uses the Compressor to compress the Learner's policy network to interact with the environment. And the generated experiences are transformed by the Corrector with Off-Policy methods, such as V-trace, Retrace and so on. Then the corrected experiences are feed to the Learner for policy learning. We believe this is the first gener
Authors
(none)
Tags
Stats
Related papers
- Acme: A Research Framework For Distributed Reinforcement Learning (2020)0.00
- Data Efficient Training For Reinforcement Learning With Adaptive Behavior Policy Sharing (2020)0.00
- Human-inspired Framework To Accelerate Reinforcement Learning (2023)0.00
- Accelerated Methods For Deep Reinforcement Learning (2018)0.00
- Adaptable Automation With Modular Deep Reinforcement Learning And Policy Transfer (2020)0.00
- Policy Augmentation: An Exploration Strategy For Faster Convergence Of Deep Reinforcement Learning Algorithms (2021)2.26
- Policy Agnostic RL: Offline RL And Online RL Fine-tuning Of Any Class And Backbone (2024)0.00
- A3C-S: Automated Agent Accelerator Co-search Towards Efficient Deep Reinforcement Learning (2021)0.00