Improving Generalization In Meta-rl With Imaginary Tasks From Latent Dynamics Mixture
2021 Β· Suyoung Lee, Sae-Young Chung
Abstract
The generalization ability of most meta-reinforcement learning (meta-RL) methods is largely limited to test tasks that are sampled from the same distribution used to sample training tasks. To overcome the limitation, we propose Latent Dynamics Mixture (LDM) that trains a reinforcement learning agent with imaginary tasks generated from mixtures of learned latent dynamics. By training a policy on mixture tasks along with original training tasks, LDM allows the agent to prepare for unseen test tasks during training and prevents the agent from overfitting the training tasks. LDM significantly outperforms standard meta-RL methods in test returns on the gridworld navigation and MuJoCo tasks where we strictly separate the training task distribution and the test task distribution.
Authors
(none)
Tags
Stats
Related papers
- Procedural Generation Of Meta-reinforcement Learning Tasks (2023)0.00
- A Tutorial On Meta-reinforcement Learning (2023)10.85
- Improving Generalization In Reinforcement Learning With Mixture Regularization (2020)0.00
- Meta Reinforcement Learning With Latent Variable Gaussian Processes (2018)0.00
- Meta Reinforcement Learning With Finite Training Tasks -- A Density Estimation Approach (2022)0.00
- Improving Generalization In Meta Reinforcement Learning Using Learned Objectives (2019)0.00
- Guided Meta-policy Search (2019)0.00
- Model-based Adversarial Meta-reinforcement Learning (2020)0.00