Inforl: Interpretable Reinforcement Learning Using Information Maximization
2019 Β· Aadil Hayat, Utsav Singh, Vinay P. Namboodiri
Abstract
Recent advances in reinforcement learning have proved that given an environment we can learn to perform a task in that environment if we have access to some form of a reward function (dense, sparse or derived from IRL). But most of the algorithms focus on learning a single best policy to perform a given set of tasks. In this paper, we focus on an algorithm that learns to not just perform a task but different ways to perform the same task. As we know when the environment is complex enough there always exists multiple ways to perform a task. We show that using the concept of information maximization it is possible to learn latent codes for discovering multiple ways to perform any given task in an environment.
Authors
(none)
Tags
Stats
Related papers
- Maxinforl: Boosting Exploration In Reinforcement Learning Through Information Gain Maximization (2024)0.00
- Maximum-likelihood Inverse Reinforcement Learning With Finite-time Guarantees (2022)0.00
- Information Directed Reward Learning For Reinforcement Learning (2021)0.00
- Inverse Reinforcement Learning With Explicit Policy Estimates (2021)2.26
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- Task-guided Inverse Reinforcement Learning Under Partial Information (2021)0.00
- Basis For Intentions: Efficient Inverse Reinforcement Learning Using Past Experience (2022)0.00
- Inverse Reinforcement Learning With Simultaneous Estimation Of Rewards And Dynamics (2016)0.00