Robust Transfer Learning With Side Information
2026 Β· Akram S. Awad, Shihab Ahmed, Yue Wang, et al.
Abstract
Robust Markov Decision Processes (MDPs) address environmental shift through distributionally robust optimization (DRO) by finding an optimal worst-case policy within an uncertainty set of transition kernels. However, standard DRO approaches require enlarging the uncertainty set under large shifts, which leads to overly conservative and pessimistic policies. In this paper, we propose a framework for transfer under environment shift that derives a robust target-domain policy via estimate-centered uncertainty sets, constructed through constrained estimation that integrates limited target samples with side information about the source-target dynamics. The side information includes bounds on feature moments, distributional distances, and density ratios, yielding improved kernel estimates and tighter uncertainty sets. The side information includes bounds on feature moments, distributional distances, and density ratios, yielding improved kernel estimates and tighter uncertainty sets. Er
Authors
(none)
Tags
Stats
Related papers
- Robust Anytime Learning Of Markov Decision Processes (2022)0.00
- Bring Your Own (non-robust) Algorithm To Solve Robust Mdps By Estimating The Worst Kernel (2023)0.00
- Linear Mixture Distributionally Robust Markov Decision Processes (2025)0.00
- Policy Learning For Robust Markov Decision Process With A Mismatched Generative Model (2022)0.00
- On The Foundation Of Distributionally Robust Reinforcement Learning (2023)0.00
- A Bayesian Approach To Robust Reinforcement Learning (2019)0.00
- Sample Complexity Of Robust Reinforcement Learning With A Generative Model (2021)0.00
- Online MDP With Transition Prototypes: A Robust Adaptive Approach (2024)0.00