Incorporating Human Flexibility Through Reward Preferences In Human-ai Teaming
2023 Β· Siddhant Bhambri, Mudit Verma, Upasana Biswas, et al.
Abstract
Preference-based Reinforcement Learning (PbRL) has made significant strides in single-agent settings, but has not been studied for multi-agent frameworks. On the other hand, modeling cooperation between multiple agents, specifically, Human-AI Teaming settings while ensuring successful task completion is a challenging problem. To this end, we perform the first investigation of multi-agent PbRL by extending single-agent PbRL to the two-agent teaming settings and formulate it as a Human-AI PbRL Cooperation Game, where the RL agent queries the human-in-the-loop to elicit task objective and human's preferences on the joint team behavior. Under this game formulation, we first introduce the notion of Human Flexibility to evaluate team performance based on if humans prefer to follow a fixed policy or adapt to the RL agent on the fly. Secondly, we study the RL agent's varying access to the human policy. We highlight a special case along these two dimensions, which we call Specified Orchestratio
Authors
(none)
Tags
Stats
Related papers
- Ra-pbrl: Provably Efficient Risk-aware Preference-based Reinforcement Learning (2024)0.00
- In Pursuit Of Predictive Models Of Human Preferences Toward AI Teammates (2025)0.00
- Enhancing Human Experience In Human-agent Collaboration: A Human-centered Modeling Approach Based On Positive Human Gain (2024)0.00
- Human-ai Coordination Via Human-regularized Search And Learning (2022)0.00
- Collaborating With Humans Without Human Data (2021)0.00
- Symbol Guided Hindsight Priors For Reward Learning From Human Preferences (2022)0.00
- Preference-based Multi-agent Reinforcement Learning: Data Coverage And Algorithmic Techniques (2024)0.00
- Improving Multimodal Interactive Agents With Reinforcement Learning From Human Feedback (2022)0.00