Confounded Causal Imitation Learning With Instrumental Variables
2025 Β· Yan Zeng, Shenglan Nie, Feng Xie, et al.
Abstract
Imitation learning from demonstrations usually suffers from the confounding effects of unmeasured variables (i.e., unmeasured confounders) on the states and actions. If ignoring them, a biased estimation of the policy would be entailed. To break up this confounding gap, in this paper, we take the best of the strong power of instrumental variables (IV) and propose a Confounded Causal Imitation Learning (C2L) model. This model accommodates confounders that influence actions across multiple timesteps, rather than being restricted to immediate temporal dependencies. We develop a two-stage imitation learning framework for valid IV identification and policy optimization. In particular, in the first stage, we construct a testing criterion based on the defined pseudo-variable, with which we achieve identifying a valid IV for the C2L models. Such a criterion entails the sufficient and necessary identifiability conditions for IV validity. In the second stage, with the identified IV, we propose t
Authors
(none)
Tags
Stats
Related papers
- Causal Imitation Learning With Unobserved Confounders (2022)0.00
- Causal Imitation Learning Under Expert-observable And Expert-unobservable Confounding (2025)0.00
- Sequential Causal Imitation Learning With Unobserved Confounders (2022)0.00
- Causal Imitation Learning Under Temporally Correlated Noise (2022)0.00
- Invariant Causal Imitation Learning For Generalizable Policies (2023)0.00
- Causal Imitation Learning Under Measurement Error And Distribution Shift (2026)0.00
- Instrumental Variable Value Iteration For Causal Offline Reinforcement Learning (2021)0.00
- Causal Confusion In Imitation Learning (2019)0.00