The Effectiveness Of World Models For Continual Reinforcement Learning
2022 · Samuel Kessler, Mateusz Ostaszewski, Michał Bortkiewicz, et al.
Abstract
World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically employ a replay buffer for training, which can be naturally extended to continual learning. We systematically study how different selective experience replay methods affect performance, forgetting, and transfer. We also provide recommendations regarding various modeling options for using world models. The best set of choices is called Continual-Dreamer, it is task-agnostic and utilizes the world model for continual exploration. Continual-Dreamer is sample efficient and outperforms state-of-the-art task-agnostic continual reinforcement learning methods on Minigrid and Minihack benchmarks.
Authors
(none)
Tags
Stats
Related papers
- Augmenting Replay In World Models For Continual Reinforcement Learning (2024)0.00
- Continual Learning Using World Models For Pseudo-rehearsal (2019)0.00
- Continual World: A Robotic Benchmark For Continual Reinforcement Learning (2021)0.00
- Continual Visual Reinforcement Learning With A Life-long World Model (2023)2.26
- Smaller World Models For Reinforcement Learning (2020)0.00
- Recurrent World Models Facilitate Policy Evolution (2018)0.00
- World Model Agents With Change-based Intrinsic Motivation (2025)0.00
- Continual Reinforcement Learning By Planning With Online World Models (2025)0.00