General Policy Mapping: Online Continual Reinforcement Learning Inspired On The Insect Brain
2022 Β· Angel Yanguas-Gil, Sandeep Madireddy
Abstract
We have developed a model for online continual or lifelong reinforcement learning (RL) inspired on the insect brain. Our model leverages the offline training of a feature extraction and a common general policy layer to enable the convergence of RL algorithms in online settings. Sharing a common policy layer across tasks leads to positive backward transfer, where the agent continuously improved in older tasks sharing the same underlying general policy. Biologically inspired restrictions to the agent's network are key for the convergence of RL algorithms. This provides a pathway towards efficient online RL in resource-constrained scenarios.
Authors
(none)
Tags
Stats
Related papers
- Learning Off-policy With Model-based Intrinsic Motivation For Active Online Exploration (2024)0.00
- Policy Agnostic RL: Offline RL And Online RL Fine-tuning Of Any Class And Backbone (2024)0.00
- Learning A Subspace Of Policies For Online Adaptation In Reinforcement Learning (2021)0.00
- Online Reinforcement Learning In Non-stationary Context-driven Environments (2023)0.00
- Continual Reinforcement Learning By Planning With Online World Models (2025)0.00
- Deep RL With Information Constrained Policies: Generalization In Continuous Control (2020)0.00
- PROTO: Iterative Policy Regularized Offline-to-online Reinforcement Learning (2023)0.00
- AWAC: Accelerating Online Reinforcement Learning With Offline Datasets (2020)0.00