Diversity For Contingency: Learning Diverse Behaviors For Efficient Adaptation And Transfer
2023 Β· Finn Rietz, Johannes Andreas Stork
Abstract
Discovering all useful solutions for a given task is crucial for transferable RL agents, to account for changes in the task or transition dynamics. This is not considered by classical RL algorithms that are only concerned with finding the optimal policy, given the current task and dynamics. We propose a simple method for discovering all possible solutions of a given task, to obtain an agent that performs well in the transfer setting and adapts quickly to changes in the task or transition dynamics. Our method iteratively learns a set of policies, while each subsequent policy is constrained to yield a solution that is unlikely under all previous policies. Unlike prior methods, our approach does not require learning additional models for novelty detection and avoids balancing task and novelty reward signals, by directly incorporating the constraint into the action selection and optimization steps.
Authors
(none)
Tags
Stats
Related papers
- Open-ended Diverse Solution Discovery With Regulated Behavior Patterns For Cross-domain Adaptation (2022)0.00
- One Solution Is Not All You Need: Few-shot Extrapolation Via Structured Maxent RL (2020)0.00
- An Advantage Based Policy Transfer Algorithm For Reinforcement Learning With Measures Of Transferability (2023)0.00
- MULTIPOLAR: Multi-source Policy Aggregation For Transfer Reinforcement Learning Between Diverse Environmental Dynamics (2019)7.81
- Post-convergence Sim-to-real Policy Transfer: A Principled Alternative To Cherry-picking (2025)0.00
- Single Episode Policy Transfer In Reinforcement Learning (2019)0.00
- Toward Robust Long Range Policy Transfer (2021)0.00
- Adarl: What, Where, And How To Adapt In Transfer Reinforcement Learning (2021)0.00