Efficientzero V2: Mastering Discrete And Continuous Control With Limited Data
2024 Β· Shengjie Wang, Shaohuai Liu, Weirui Ye, et al.
Abstract
Sample efficiency remains a crucial challenge in applying Reinforcement Learning (RL) to real-world tasks. While recent algorithms have made significant strides in improving sample efficiency, none have achieved consistently superior performance across diverse domains. In this paper, we introduce EfficientZero V2, a general framework designed for sample-efficient RL algorithms. We have expanded the performance of EfficientZero to multiple domains, encompassing both continuous and discrete actions, as well as visual and low-dimensional inputs. With a series of improvements we propose, EfficientZero V2 outperforms the current state-of-the-art (SOTA) by a significant margin in diverse tasks under the limited data setting. EfficientZero V2 exhibits a notable advancement over the prevailing general algorithm, DreamerV3, achieving superior outcomes in 50 of 66 evaluated tasks across diverse benchmarks, such as Atari 100k, Proprio Control, and Vision Control.
Authors
(none)
Tags
Stats
Related papers
- Human-inspired Framework To Accelerate Reinforcement Learning (2023)0.00
- Measuring Progress In Deep Reinforcement Learning Sample Efficiency (2021)0.00
- Off-policy RL Algorithms Can Be Sample-efficient For Continuous Control Via Sample Multiple Reuse (2023)0.00
- Drm: Mastering Visual Reinforcement Learning Through Dormant Ratio Minimization (2023)0.00
- Zero-shot Reinforcement Learning From Low Quality Data (2023)0.00
- Importance Of Using Appropriate Baselines For Evaluation Of Data-efficiency In Deep Reinforcement Learning For Atari (2020)0.00
- Towards Sample-efficiency And Generalization Of Transfer And Inverse Reinforcement Learning: A Comprehensive Literature Review (2024)0.00
- On Zero-shot Reinforcement Learning (2025)0.00