Active Exploration For Inverse Reinforcement Learning
2022 Β· David Lindner, Andreas Krause, Giorgia Ramponi
Abstract
Inverse Reinforcement Learning (IRL) is a powerful paradigm for inferring a reward function from expert demonstrations. Many IRL algorithms require a known transition model and sometimes even a known expert policy, or they at least require access to a generative model. However, these assumptions are too strong for many real-world applications, where the environment can be accessed only through sequential interaction. We propose a novel IRL algorithm: Active exploration for Inverse Reinforcement Learning (AceIRL), which actively explores an unknown environment and expert policy to quickly learn the expert's reward function and identify a good policy. AceIRL uses previous observations to construct confidence intervals that capture plausible reward functions and find exploration policies that focus on the most informative regions of the environment. AceIRL is the first approach to active IRL with sample-complexity bounds that does not require a generative model of the environment. AceIRL
Authors
(none)
Tags
Stats
Related papers
- Inverse Reinforcement Learning Without Reinforcement Learning (2023)0.00
- Active Learning For Risk-sensitive Inverse Reinforcement Learning (2019)0.00
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- Is Inverse Reinforcement Learning Harder Than Standard Reinforcement Learning? A Theoretical Perspective (2023)0.00
- Offline Inverse RL: New Solution Concepts And Provably Efficient Algorithms (2024)0.00
- A Survey Of Inverse Reinforcement Learning: Challenges, Methods And Progress (2018)0.00
- Provably Efficient Exploration In Inverse Constrained Reinforcement Learning (2024)0.00
- Inverse Reinforcement Learning With Simultaneous Estimation Of Rewards And Dynamics (2016)0.00