On Zero-shot Reinforcement Learning
2025 Β· Scott Jeen
Abstract
Modern reinforcement learning (RL) systems capture deep truths about general, human problem-solving. In domains where new data can be simulated cheaply, these systems uncover sequential decision-making policies that far exceed the ability of any human. Society faces many problems whose solutions require this skill, but they are often in domains where new data cannot be cheaply simulated. In such scenarios, we can learn simulators from existing data, but these will only ever be approximately correct, and can be pathologically incorrect when queried outside of their training distribution. As a result, a misalignment between the environments in which we train our agents and the real-world in which we wish to deploy our agents is inevitable. Dealing with this misalignment is the primary concern of zero-shot reinforcement learning, a problem setting where the agent must generalise to a new task or domain with zero practice shots. Whilst impressive progress has been made on methods that perf
Authors
(none)
Tags
Stats
Related papers
- Does Zero-shot Reinforcement Learning Exist? (2022)0.00
- A Unified Framework For Zero-shot Reinforcement Learning (2025)0.00
- Zero-shot Reinforcement Learning From Low Quality Data (2023)0.00
- DARLA: Improving Zero-shot Transfer In Reinforcement Learning (2017)0.00
- Statistical Reinforcement Learning In The Real World: A Survey Of Challenges And Future Directions (2026)0.00
- DRED: Zero-shot Transfer In Reinforcement Learning Via Data-regularised Environment Design (2024)1.81
- Zero-shot Reinforcement Learning Via Function Encoders (2024)0.00
- Cross-trajectory Representation Learning For Zero-shot Generalization In RL (2021)0.00