Task Aware Dreamer For Task Generalization In Reinforcement Learning
2023 Β· Chengyang Ying, Xinning Zhou, Zhongkai Hao, et al.
Abstract
A long-standing goal of reinforcement learning is to acquire agents that can learn on training tasks and generalize well on unseen tasks that may share a similar dynamic but with different reward functions. The ability to generalize across tasks is important as it determines an agent's adaptability to real-world scenarios where reward mechanisms might vary. In this work, we first show that training a general world model can utilize similar structures in these tasks and help train more generalizable agents. Extending world models into the task generalization setting, we introduce a novel method named Task Aware Dreamer (TAD), which integrates reward-informed features to identify consistent latent characteristics across tasks. Within TAD, we compute the variational lower bound of sample data log-likelihood, which introduces a new term designed to differentiate tasks using their states, as the optimization objective of our reward-informed world models. To demonstrate the advantages of the
Authors
(none)
Tags
Stats
Related papers
- Harmonydream: Task Harmonization Inside World Models (2023)3.46
- Do Agents Dream Of Electric Sheep?: Improving Generalization In Reinforcement Learning Through Generative Learning (2024)0.00
- Powderworld: A Platform For Understanding Generalization Via Rich Task Distributions (2022)0.00
- Improving Generalization In Meta-rl With Imaginary Tasks From Latent Dynamics Mixture (2021)0.00
- A Model-based Approach For Sample-efficient Multi-task Reinforcement Learning (2019)0.00
- Unifying Task Specification In Reinforcement Learning (2016)0.00
- Dream To Control: Learning Behaviors By Latent Imagination (2019)0.00
- Towards Unraveling And Improving Generalization In World Models (2024)0.00