Augmenting Replay In World Models For Continual Reinforcement Learning
2024 Β· Luke Yang, Levin Kuhlmann, Gideon Kowadlo
Abstract
Continual RL requires an agent to learn new tasks without forgetting previous ones, while improving on both past and future tasks. The most common approaches use model-free algorithms and replay buffers can help to mitigate catastrophic forgetting, but often struggle with scalability due to large memory requirements. Biologically inspired replay suggests replay to a world model, aligning with model-based RL; as opposed to the common setting of replay in model-free algorithms. Model-based RL offers benefits for continual RL by leveraging knowledge of the environment, independent of policy. We introduce WMAR (World Models with Augmented Replay), a model-based RL algorithm with a memory-efficient distribution-matching replay buffer. WMAR extends the well known DreamerV3 algorithm, which employs a simple FIFO buffer and was not tested in continual RL. We evaluated WMAR and DreamerV3, with the same-size replay buffers. They were tested on two scenarios: tasks with shared structure using Ope
Authors
(none)
Tags
Stats
Related papers
- The Effectiveness Of World Models For Continual Reinforcement Learning (2022)0.00
- Augmented Replay Memory In Reinforcement Learning With Continuous Control (2019)5.24
- Continual Learning Using World Models For Pseudo-rehearsal (2019)0.00
- Replay-enhanced Continual Reinforcement Learning (2023)0.00
- Replay Buffer With Local Forgetting For Adapting To Local Environment Changes In Deep Model-based Reinforcement Learning (2023)0.00
- Map-based Experience Replay: A Memory-efficient Solution To Catastrophic Forgetting In Reinforcement Learning (2023)9.23
- Task-agnostic Continual Reinforcement Learning: Gaining Insights And Overcoming Challenges (2022)0.00
- Continual World: A Robotic Benchmark For Continual Reinforcement Learning (2021)0.00