Data-driven Inverse Reinforcement Learning For Expert-learner Zero-sum Games
2023 Β· Wenqian Xue, Bosen Lian, Jialu Fan, et al.
Abstract
In this paper, we formulate inverse reinforcement learning (IRL) as an expert-learner interaction whereby the optimal performance intent of an expert or target agent is unknown to a learner agent. The learner observes the states and controls of the expert and hence seeks to reconstruct the expert's cost function intent and thus mimics the expert's optimal response. Next, we add non-cooperative disturbances that seek to disrupt the learning and stability of the learner agent. This leads to the formulation of a new interaction we call zero-sum game IRL. We develop a framework to solve the zero-sum game IRL problem that is a modified extension of RL policy iteration (PI) to allow unknown expert performance intentions to be computed and non-cooperative disturbances to be rejected. The framework has two parts: a value function and control action update based on an extension of PI, and a cost function update based on standard inverse optimal control. Then, we eventually develop an off-policy
Authors
(none)
Tags
Stats
Related papers
- Inverse Reinforcement Learning Without Reinforcement Learning (2023)0.00
- Active Exploration For Inverse Reinforcement Learning (2022)0.00
- Towards Theoretical Understanding Of Inverse Reinforcement Learning (2023)0.00
- Is Inverse Reinforcement Learning Harder Than Standard Reinforcement Learning? A Theoretical Perspective (2023)0.00
- Offline Inverse RL: New Solution Concepts And Provably Efficient Algorithms (2024)0.00
- Task-guided Inverse Reinforcement Learning Under Partial Information (2021)0.00
- Non-adversarial Inverse Reinforcement Learning Via Successor Feature Matching (2024)0.00
- Maximum-likelihood Inverse Reinforcement Learning With Finite-time Guarantees (2022)0.00