STORM: Efficient Stochastic Transformer Based World Models For Reinforcement Learning
2023 Β· Weipu Zhang, Gang Wang, Jian Sun, et al.
Abstract
Recently, model-based reinforcement learning algorithms have demonstrated remarkable efficacy in visual input environments. These approaches begin by constructing a parameterized simulation world model of the real environment through self-supervised learning. By leveraging the imagination of the world model, the agent's policy is enhanced without the constraints of sampling from the real environment. The performance of these algorithms heavily relies on the sequence modeling and generation capabilities of the world model. However, constructing a perfectly accurate model of a complex unknown environment is nearly impossible. Discrepancies between the model and reality may cause the agent to pursue virtual goals, resulting in subpar performance in the real environment. Introducing random noise into model-based reinforcement learning has been proven beneficial. In this work, we introduce Stochastic Transformer-based wORld Model (STORM), an efficient world model architecture that combines
Authors
(none)
Tags
Stats
Related papers
- Transformers Are Sample-efficient World Models (2022)0.00
- Object-centric World Models For Causality-aware Reinforcement Learning (2025)0.00
- Decentralized Transformers With Centralized Aggregation Are Sample-efficient Multi-agent World Models (2024)0.00
- Do Transformer World Models Give Better Policy Gradients? (2024)0.00
- Smaller World Models For Reinforcement Learning (2020)0.00
- Recurrent World Models Facilitate Policy Evolution (2018)0.00
- The Effectiveness Of World Models For Continual Reinforcement Learning (2022)0.00
- Learning To Play Atari In A World Of Tokens (2024)0.00