Doubly Inhomogeneous Reinforcement Learning
2022 Β· Liyuan Hu, Mengbing Li, Chengchun Shi, et al.
Abstract
This paper studies reinforcement learning (RL) in doubly inhomogeneous environments under temporal non-stationarity and subject heterogeneity. In a number of applications, it is commonplace to encounter datasets generated by system dynamics that may change over time and population, challenging high-quality sequential decision making. Nonetheless, most existing RL solutions require either temporal stationarity or subject homogeneity, which would result in sub-optimal policies if both assumptions were violated. To address both challenges simultaneously, we propose an original algorithm to determine the ``best data chunks" that display similar dynamics over time and across individuals for policy learning, which alternates between most recent change point detection and cluster identification. Our method is general, and works with a wide range of clustering and change point detection algorithms. It is multiply robust in the sense that it takes multiple initial estimators as input and only r
Authors
(none)
Tags
Stats
Related papers
- Off-policy Evaluation In Doubly Inhomogeneous Environments (2023)7.16
- Reinforcement Learning For Individual Optimal Policy From Heterogeneous Data (2025)0.00
- Demystifying Reinforcement Learning In Time-varying Systems (2022)0.00
- Model-agnostic Solutions For Deep Reinforcement Learning In Non-ergodic Contexts (2026)0.00
- Reinforcement Learning With Non-ergodic Reward Increments: Robustness Via Ergodicity Transformations (2023)0.00
- Reinforcement Learning In Presence Of Discrete Markovian Context Evolution (2022)0.00
- Online Reinforcement Learning In Non-stationary Context-driven Environments (2023)0.00
- Federated Reinforcement Learning With Constraint Heterogeneity (2024)0.00