DEALIO: Data-efficient Adversarial Learning For Imitation From Observation
2021 Β· Faraz Torabi, Garrett Warnell, Peter Stone
Abstract
In imitation learning from observation IfO, a learning agent seeks to imitate a demonstrating agent using only observations of the demonstrated behavior without access to the control signals generated by the demonstrator. Recent methods based on adversarial imitation learning have led to state-of-the-art performance on IfO problems, but they typically suffer from high sample complexity due to a reliance on data-inefficient, model-free reinforcement learning algorithms. This issue makes them impractical to deploy in real-world settings, where gathering samples can incur high costs in terms of time, energy, and risk. In this work, we hypothesize that we can incorporate ideas from model-based reinforcement learning with adversarial methods for IfO in order to increase the data efficiency of these methods without sacrificing performance. Specifically, we consider time-varying linear Gaussian policies, and propose a method that integrates the linear-quadratic regulator with path integral po
Authors
(none)
Tags
Stats
Related papers
- Imitation From Observation With Bootstrapped Contrastive Learning (2023)0.00
- On Discovering Algorithms For Adversarial Imitation Learning (2025)0.00
- Provably Efficient Imitation Learning From Observation Alone (2019)0.00
- Imitating Opponent To Win: Adversarial Policy Imitation Learning In Two-player Competitive Games (2022)0.00
- Discriminator-actor-critic: Addressing Sample Inefficiency And Reward Bias In Adversarial Imitation Learning (2018)0.00
- Imitation Learning From Observations By Minimizing Inverse Dynamics Disagreement (2019)0.00
- A Dual Approach To Imitation Learning From Observations With Offline Datasets (2024)0.00
- Non-adversarial Imitation Learning And Its Connections To Adversarial Methods (2020)0.00