Inverse Reinforcement Learning Using Revealed Preferences And Passive Stochastic Optimization
2025 Β· Vikram Krishnamurthy
Abstract
This monograph, spanning three chapters, explores Inverse Reinforcement Learning (IRL). The first two chapters view inverse reinforcement learning (IRL) through the lens of revealed preferences from microeconomics while the third chapter studies adaptive IRL via Langevin dynamics stochastic gradient algorithms. Chapter uses classical revealed preference theory (Afriat's theorem and extensions) to identify constrained utility maximizers based on observed agent actions. This allows for the reconstruction of set-valued estimates of an agent's utility. We illustrate this procedure by identifying the presence of a cognitive radar and reconstructing its utility function. The chapter also addresses the construction of a statistical detector for utility maximization behavior when agent actions are corrupted by noise. Chapter 2 studies Bayesian IRL. It investigates how an analyst can determine if an observed agent is a rationally inattentive Bayesian utility maximizer (i.e., simultaneously
Authors
(none)
Tags
Stats
Related papers
- Inverse Reinforcement Learning With Explicit Policy Estimates (2021)2.26
- Langevin Dynamics For Adaptive Inverse Reinforcement Learning Of Stochastic Gradient Algorithms (2020)0.00
- Inverse-inverse Reinforcement Learning. How To Hide Strategy From An Adversarial Inverse Reinforcement Learner (2022)7.50
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- A Survey Of Inverse Reinforcement Learning: Challenges, Methods And Progress (2018)0.00
- Efficient Inference For Inverse Reinforcement Learning And Dynamic Discrete Choice Models (2025)0.00
- Inverse Reinforcement Learning Using Just Classification And A Few Regressions (2025)0.00
- Accounting For Human Learning When Inferring Human Preferences (2020)0.00