Surreal-system: Fully-integrated Stack For Distributed Deep Reinforcement Learning
2019 Β· Linxi Fan, Yuke Zhu, Jiren Zhu, et al.
Abstract
We present an overview of SURREAL-System, a reproducible, flexible, and scalable framework for distributed reinforcement learning (RL). The framework consists of a stack of four layers: Provisioner, Orchestrator, Protocol, and Algorithms. The Provisioner abstracts away the machine hardware and node pools across different cloud providers. The Orchestrator provides a unified interface for scheduling and deploying distributed algorithms by high-level description, which is capable of deploying to a wide range of hardware from a personal laptop to full-fledged cloud clusters. The Protocol provides network communication primitives optimized for RL. Finally, the SURREAL algorithms, such as Proximal Policy Optimization (PPO) and Evolution Strategies (ES), can easily scale to 1000s of CPU cores and 100s of GPUs. The learning performances of our distributed algorithms establish new state-of-the-art on OpenAI Gym and Robotics Suites tasks.
Authors
(none)
Tags
Stats
Related papers
- SRL: Scaling Distributed Reinforcement Learning To Over Ten Thousand Cores (2023)0.00
- Rllib: Abstractions For Distributed Reinforcement Learning (2017)0.00
- The AI Arena: A Framework For Distributed Multi-agent Reinforcement Learning (2021)0.00
- MSRL: Distributed Reinforcement Learning With Dataflow Fragments (2022)0.00
- SLM Lab: A Comprehensive Benchmark And Modular Software Framework For Reproducible Deep Reinforcement Learning (2019)0.00
- A Scalable And Reproducible System-on-chip Simulation For Reinforcement Learning (2021)0.00
- Computerrl: Scaling End-to-end Online Reinforcement Learning For Computer Use Agents (2025)0.00
- Integrating Distributed Architectures In Highly Modular RL Libraries (2020)0.00