A Unified Framework For Zero-shot Reinforcement Learning
2025 Β· Jacopo di Ventura, Jan Felix Kleuker, Aske Plaat, et al.
Abstract
Zero-shot reinforcement learning (RL) has emerged as a setting for developing general agents, capable of solving downstream tasks without additional training or planning at test-time. While conventional RL optimizes policies for fixed rewards, zero-shot RL requires learning representations that enable immediate adaptation to arbitrary reward functions. As the field matures, the growing diversity of approaches demands a foundational framework reconciling different perspectives under a common unifying structure. In this work, we introduce a formal, unified framework for zero-shot RL, allowing for rigorous comparisons across methods. We propose a taxonomy organizing the algorithmic landscape along two levels: representation, distinguishing between compositional and direct methods based on their exploitation of action-value function decompositions; and learning paradigm, differentiating between reward-free and pseudo reward-free training. Additionally, we propose a unified view of existing
Authors
(none)
Tags
Stats
Related papers
- Does Zero-shot Reinforcement Learning Exist? (2022)0.00
- On Zero-shot Reinforcement Learning (2025)0.00
- Cross-trajectory Representation Learning For Zero-shot Generalization In RL (2021)0.00
- Zero-shot Reinforcement Learning Via Function Encoders (2024)0.00
- Zero-shot Reinforcement Learning From Low Quality Data (2023)0.00
- Inferring Behavior-specific Context Improves Zero-shot Generalization In Reinforcement Learning (2024)0.95
- Zero-shot Policy Learning With Spatial Temporal Rewarddecomposition On Contingency-aware Observation (2019)0.00
- Ancestral Reinforcement Learning: Unifying Zeroth-order Optimization And Genetic Algorithms For Reinforcement Learning (2024)0.00