Hypernetworks For Zero-shot Transfer In Reinforcement Learning
2022 Β· Sahand Rezaei-Shoshtari, Charlotte Morissette, Francois Robert Hogan, et al.
Abstract
In this paper, hypernetworks are trained to generate behaviors across a range of unseen task conditions, via a novel TD-based training objective and data from a set of near-optimal RL solutions for training tasks. This work relates to meta RL, contextual RL, and transfer learning, with a particular focus on zero-shot performance at test time, enabled by knowledge of the task parameters (also known as context). Our technical approach is based upon viewing each RL algorithm as a mapping from the MDP specifics to the near-optimal value function and policy and seek to approximate it with a hypernetwork that can generate near-optimal value functions and policies, given the parameters of the MDP. We show that, under certain conditions, this mapping can be considered as a supervised learning problem. We empirically evaluate the effectiveness of our method for zero-shot transfer to new reward and transition dynamics on a series of continuous control tasks from DeepMind Control Suite. Our metho
Authors
(none)
Tags
Stats
Related papers
- On Zero-shot Reinforcement Learning (2025)0.00
- A Unified Framework For Zero-shot Reinforcement Learning (2025)0.00
- Does Zero-shot Reinforcement Learning Exist? (2022)0.00
- Cross-trajectory Representation Learning For Zero-shot Generalization In RL (2021)0.00
- HMRL: Hyper-meta Learning For Sparse Reward Reinforcement Learning Problem (2020)0.00
- Zero-shot Reinforcement Learning Via Function Encoders (2024)0.00
- Exploration In Approximate Hyper-state Space For Meta Reinforcement Learning (2020)0.00
- Inferring Behavior-specific Context Improves Zero-shot Generalization In Reinforcement Learning (2024)0.95