Multi-agent Inverse Reinforcement Learning: Suboptimal Demonstrations And Alternative Solution Concepts
2021 Β· Sage Bergerson
Abstract
Multi-agent inverse reinforcement learning (MIRL) can be used to learn reward functions from agents in social environments. To model realistic social dynamics, MIRL methods must account for suboptimal human reasoning and behavior. Traditional formalisms of game theory provide computationally tractable behavioral models, but assume agents have unrealistic cognitive capabilities. This research identifies and compares mechanisms in MIRL methods which a) handle noise, biases and heuristics in agent decision making and b) model realistic equilibrium solution concepts. MIRL research is systematically reviewed to identify solutions for these challenges. The methods and results of these studies are analyzed and compared based on factors including performance accuracy, efficiency, and descriptive quality. We found that the primary methods for handling noise, biases and heuristics in MIRL were extensions of Maximum Entropy (MaxEnt) IRL to multi-agent settings. We also found that many successful
Authors
(none)
Tags
Stats
Related papers
- Multi-agent Inverse Reinforcement Learning For Certain General-sum Stochastic Games (2018)10.97
- On Feasible Rewards In Multi-agent Inverse Reinforcement Learning (2024)0.00
- Adversarial Inverse Reinforcement Learning For Mean Field Games (2021)0.00
- Discovering Individual Rewards In Collective Behavior Through Inverse Multi-agent Reinforcement Learning (2023)0.00
- Learning Human Rewards By Inferring Their Latent Intelligence Levels In Multi-agent Games: A Theory-of-mind Approach With Application To Driving Data (2021)0.00
- Understanding The World To Solve Social Dilemmas Using Multi-agent Reinforcement Learning (2023)0.00
- Emergent Social Learning Via Multi-agent Reinforcement Learning (2020)0.00
- Learning Behavioral Soft Constraints From Demonstrations (2022)0.00