Learning Human Rewards By Inferring Their Latent Intelligence Levels In Multi-agent Games: A Theory-of-mind Approach With Application To Driving Data
2021 Β· Ran Tian, Masayoshi Tomizuka, Liting Sun
Abstract
Reward function, as an incentive representation that recognizes humans' agency and rationalizes humans' actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an effective way to retrieve reward functions from demonstrations. However, it has always been challenging when applying it to multi-agent settings since the mutual influence between agents has to be appropriately modeled. To tackle this challenge, previous work either exploits equilibrium solution concepts by assuming humans as perfectly rational optimizers with unbounded intelligence or pre-assigns humans' interaction strategies a priori. In this work, we advocate that humans are bounded rational and have different intelligence levels when reasoning about others' decision-making process, and such an inherent and latent characteristic should be accounted for in reward learning algorithms. Hence, we exploit such insights from Theory-of-Mind and propose a new mu
Authors
(none)
Tags
Stats
Related papers
- Theory Of Mind As Intrinsic Motivation For Multi-agent Reinforcement Learning (2023)0.00
- Multi-agent Inverse Reinforcement Learning: Suboptimal Demonstrations And Alternative Solution Concepts (2021)0.00
- Improving Multimodal Interactive Agents With Reinforcement Learning From Human Feedback (2022)0.00
- Human-level Reinforcement Learning Through Theory-based Modeling, Exploration, And Planning (2021)0.00
- Scalable Agent Alignment Via Reward Modeling: A Research Direction (2018)0.00
- Evaluating Agents Without Rewards (2020)0.00
- Bounded Risk-sensitive Markov Games: Forward Policy Design And Inverse Reward Learning With Iterative Reasoning And Cumulative Prospect Theory (2020)0.00
- Reward Shaping For Human Learning Via Inverse Reinforcement Learning (2020)0.00