Distributionally Robust Self Paced Curriculum Reinforcement Learning
2025 Β· Anirudh Satheesh, Keenan Powell, Vaneet Aggarwal
Abstract
A central challenge in reinforcement learning is that policies trained in controlled environments often fail under distribution shifts at deployment into real-world environments. Distributionally Robust Reinforcement Learning (DRRL) addresses this by optimizing for worst-case performance within an uncertainty set defined by a robustness budget \(\epsilon\). However, fixing \(\epsilon\) results in a tradeoff between performance and robustness: small values yield high nominal performance but weak robustness, while large values can result in instability and overly conservative policies. We propose Distributionally Robust Self-Paced Curriculum Reinforcement Learning (DR-SPCRL), a method that overcomes this limitation by treating \(\epsilon\) as a continuous curriculum. DR-SPCRL adaptively schedules the robustness budget according to the agent's progress, enabling a balance between nominal and robust performance. Empirical results across multiple environments demonstrate that DR-SPCRL not o
Authors
(none)
Tags
Stats
Related papers
- On The Foundation Of Distributionally Robust Reinforcement Learning (2023)0.00
- Single-trajectory Distributionally Robust Reinforcement Learning (2023)0.00
- Improving Robustness Via Risk Averse Distributional Reinforcement Learning (2020)0.00
- DR-SAC: Distributionally Robust Soft Actor-critic For Reinforcement Learning Under Uncertainty (2025)0.00
- The Curious Price Of Distributional Robustness In Reinforcement Learning With A Generative Model (2023)0.00
- A Risk-sensitive Approach To Policy Optimization (2022)3.58
- Robust Deep Reinforcement Learning Through Bootstrapped Opportunistic Curriculum (2022)0.00
- Robust Domain Randomised Reinforcement Learning Through Peer-to-peer Distillation (2020)0.00