Causal Imitation Learning Under Expert-observable And Expert-unobservable Confounding
2025 Β· Daqian Shao, Thomas Kleine Buening, Marta Kwiatkowska
Abstract
We propose a general framework for causal Imitation Learning (IL) with hidden confounders, which subsumes several existing settings. Our framework accounts for two types of hidden confounders: (a) variables observed by the expert but not by the imitator, and (b) confounding noise hidden from both. By leveraging trajectory histories as instruments, we reformulate causal IL in our framework into a Conditional Moment Restriction (CMR) problem. We propose DML-IL, an algorithm that solves this CMR problem via instrumental variable regression, and upper bound its imitation gap. Empirical evaluation on continuous state-action environments, including Mujoco tasks, demonstrates that DML-IL outperforms existing causal IL baselines.
Authors
(none)
Tags
Stats
Related papers
- Causal Imitation Learning With Unobserved Confounders (2022)0.00
- Confounded Causal Imitation Learning With Instrumental Variables (2025)0.00
- Causal Imitation Learning Under Measurement Error And Distribution Shift (2026)0.00
- Sequential Causal Imitation Learning With Unobserved Confounders (2022)0.00
- Invariant Causal Imitation Learning For Generalizable Policies (2023)0.00
- Causal Confusion In Imitation Learning (2019)0.00
- Causal Imitation Learning Under Temporally Correlated Noise (2022)0.00
- Toward The Fundamental Limits Of Imitation Learning (2020)0.00