FP-IRL: Fokker-planck Inverse Reinforcement Learning -- A Physics-constrained Approach To Markov Decision Processes
2023 Β· Chengyang Huang, Siddhartha Srivastava, Kenneth K. Y. Ho, et al.
Abstract
Inverse reinforcement learning (IRL) is a powerful paradigm for uncovering the incentive structure that drives agent behavior, by inferring an unknown reward function from observed trajectories within a Markov decision process (MDP). However, most existing IRL methods require access to the transition function, either prescribed or estimated \textit\{a priori\}, which poses significant challenges when the underlying dynamics are unknown, unobservable, or not easily sampled. We propose Fokker--Planck inverse reinforcement learning (FP-IRL), a novel physics-constrained IRL framework tailored for systems governed by Fokker--Planck (FP) dynamics. FP-IRL simultaneously infers both the reward and transition functions directly from trajectory data, without requiring access to sampled transitions. Our method leverages a conjectured equivalence between MDPs and the FP equation, linking reward maximization in MDPs with free energy minimization in FP dynamics. This connection enables inference o
Authors
(none)
Tags
Stats
Related papers
- Inverse Reinforcement Learning With Simultaneous Estimation Of Rewards And Dynamics (2016)0.00
- Task-guided IRL In Pomdps That Scales (2022)2.26
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- Distributional Inverse Reinforcement Learning (2025)0.00
- Task-guided Inverse Reinforcement Learning Under Partial Information (2021)0.00
- CWAE-IRL: Formulating A Supervised Approach To Inverse Reinforcement Learning Problem (2019)0.00
- Inverse Reinforcement Learning Without Reinforcement Learning (2023)0.00
- Efficient Inference For Inverse Reinforcement Learning And Dynamic Discrete Choice Models (2025)0.00