The Curious Price Of Distributional Robustness In Reinforcement Learning With A Generative Model
2023 Β· Laixi Shi, Gen Li, Yuting Wei, et al.
Abstract
This paper investigates model robustness in reinforcement learning (RL) to reduce the sim-to-real gap in practice. We adopt the framework of distributionally robust Markov decision processes (RMDPs), aimed at learning a policy that optimizes the worst-case performance when the deployed environment falls within a prescribed uncertainty set around the nominal MDP. Despite recent efforts, the sample complexity of RMDPs remained mostly unsettled regardless of the uncertainty set in use. It was unclear if distributional robustness bears any statistical consequences when benchmarked against standard RL. Assuming access to a generative model that draws samples based on the nominal MDP, we provide a near-optimal characterization of the sample complexity of RMDPs when the uncertainty set is specified via either the total variation (TV) distance or chi-squared divergence. The algorithm studied here is a model-based method called distributionally robust value iteration, which is shown to be near-
Authors
(none)
Tags
Stats
Related papers
- Sample Complexity Of Robust Reinforcement Learning With A Generative Model (2021)0.00
- On The Foundation Of Distributionally Robust Reinforcement Learning (2023)0.00
- Improving Robustness Via Risk Averse Distributional Reinforcement Learning (2020)0.00
- Sample Complexity Of Offline Distributionally Robust Linear Markov Decision Processes (2024)0.00
- Distributionally Robust Model-based Offline Reinforcement Learning With Near-optimal Sample Complexity (2022)0.00
- Sample-efficient Robust Multi-agent Reinforcement Learning In The Face Of Environmental Uncertainty (2024)0.00
- Distributionally Robust Model-based Reinforcement Learning With Large State Spaces (2023)0.00
- Distributional Robustness And Regularization In Reinforcement Learning (2020)0.00