CWAE-IRL: Formulating A Supervised Approach To Inverse Reinforcement Learning Problem
2019 Β· Arpan Kusari
Abstract
Inverse reinforcement learning (IRL) is used to infer the reward function from the actions of an expert running a Markov Decision Process (MDP). A novel approach using variational inference for learning the reward function is proposed in this research. Using this technique, the intractable posterior distribution of the continuous latent variable (the reward function in this case) is analytically approximated to appear to be as close to the prior belief while trying to reconstruct the future state conditioned on the current state and action. The reward function is derived using a well-known deep generative model known as Conditional Variational Auto-encoder (CVAE) with Wasserstein loss function, thus referred to as Conditional Wasserstein Auto-encoder-IRL (CWAE-IRL), which can be analyzed as a combination of the backward and forward inference. This can then form an efficient alternative to the previous approaches to IRL while having no knowledge of the system dynamics of the agent. Expe
Authors
(none)
Tags
Stats
Related papers
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- Inverse Reinforcement Learning With Simultaneous Estimation Of Rewards And Dynamics (2016)0.00
- A Survey Of Inverse Reinforcement Learning: Challenges, Methods And Progress (2018)0.00
- Active Exploration For Inverse Reinforcement Learning (2022)0.00
- Maximum-likelihood Inverse Reinforcement Learning With Finite-time Guarantees (2022)0.00
- Inverse Reinforcement Learning Without Reinforcement Learning (2023)0.00
- Inverse Reinforcement Learning With Explicit Policy Estimates (2021)2.26
- Offline Inverse RL: New Solution Concepts And Provably Efficient Algorithms (2024)0.00