Causal Imitation Learning Under Temporally Correlated Noise
2022 Β· Gokul Swamy, Sanjiban Choudhury, J. Andrew Bagnell, et al.
Abstract
We develop algorithms for imitation learning from policy data that was corrupted by temporally correlated noise in expert actions. When noise affects multiple timesteps of recorded data, it can manifest as spurious correlations between states and actions that a learner might latch on to, leading to poor policy performance. To break up these spurious correlations, we apply modern variants of the instrumental variable regression (IVR) technique of econometrics, enabling us to recover the underlying policy without requiring access to an interactive expert. In particular, we present two techniques, one of a generative-modeling flavor (DoubIL) that can utilize access to a simulator, and one of a game-theoretic flavor (ResiduIL) that can be run entirely offline. We find both of our algorithms compare favorably to behavioral cloning on simulated control tasks.
Authors
(none)
Tags
Stats
Related papers
- Causal Imitation Learning Under Measurement Error And Distribution Shift (2026)0.00
- Invariant Causal Imitation Learning For Generalizable Policies (2023)0.00
- Causal Confusion In Imitation Learning (2019)0.00
- Confounded Causal Imitation Learning With Instrumental Variables (2025)0.00
- Causal Imitation Learning With Unobserved Confounders (2022)0.00
- Causal Imitation Learning Under Expert-observable And Expert-unobservable Confounding (2025)0.00
- Feedback In Imitation Learning: The Three Regimes Of Covariate Shift (2021)0.00
- The Pitfalls Of Imitation Learning When Actions Are Continuous (2025)0.00