Can Learned Optimization Make Reinforcement Learning Less Difficult?
2024 Β· Alexander David Goldie, Chris Lu, Matthew Thomas Jackson, et al.
Abstract
While reinforcement learning (RL) holds great potential for decision making in the real world, it suffers from a number of unique difficulties which often need specific consideration. In particular: it is highly non-stationary; suffers from high degrees of plasticity loss; and requires exploration to prevent premature convergence to local optima and maximize return. In this paper, we consider whether learned optimization can help overcome these problems. Our method, Learned Optimization for Plasticity, Exploration and Non-stationarity (OPEN), meta-learns an update rule whose input features and output structure are informed by previously proposed solutions to these difficulties. We show that our parameterization is flexible enough to enable meta-learning in diverse learning contexts, including the ability to use stochasticity for exploration. Our experiments demonstrate that when meta-trained on single and small sets of environments, OPEN outperforms or equals traditionally used optimiz
Authors
(none)
Tags
Stats
Related papers
- Learning To Optimize For Reinforcement Learning (2023)0.00
- Learning Curricula In Open-ended Worlds (2023)0.00
- Is Exploration Or Optimization The Problem For Deep Reinforcement Learning? (2025)0.00
- First-explore, Then Exploit: Meta-learning To Solve Hard Exploration-exploitation Trade-offs (2023)0.00
- Discovering Reinforcement Learning Algorithms (2020)0.00
- Decoupling Exploration And Exploitation For Meta-reinforcement Learning Without Sacrifices (2020)0.00
- Discovering General Reinforcement Learning Algorithms With Adversarial Environment Design (2023)0.00
- Towards An Adaptable And Generalizable Optimization Engine In Decision And Control: A Meta Reinforcement Learning Approach (2024)0.00