Collaborating With Humans Without Human Data
2021 Β· Dj Strouse, Kevin R. McKee, Matt Botvinick, et al.
Abstract
Collaborating with humans requires rapidly adapting to their individual strengths, weaknesses, and preferences. Unfortunately, most standard multi-agent reinforcement learning techniques, such as self-play (SP) or population play (PP), produce agents that overfit to their training partners and do not generalize well to humans. Alternatively, researchers can collect human data, train a human model using behavioral cloning, and then use that model to train "human-aware" agents ("behavioral cloning play", or BCP). While such an approach can improve the generalization of agents to new human co-players, it involves the onerous and expensive step of collecting large amounts of human data first. Here, we study the problem of how to train agents that collaborate well with human partners without using human data. We argue that the crux of the problem is to produce a diverse set of training partners. Drawing inspiration from successful multi-agent approaches in competitive domains, we find that
Authors
(none)
Tags
Stats
Related papers
- Learning Zero-shot Cooperation With Humans, Assuming Humans Are Biased (2023)0.00
- A Hierarchical Approach To Population Training For Human-ai Collaboration (2023)0.00
- Human-ai Coordination Via Human-regularized Search And Learning (2022)0.00
- Reinforcement Learning On Human Decision Models For Uniquely Collaborative AI Teammates (2021)0.00
- Enhancing Human Experience In Human-agent Collaboration: A Human-centered Modeling Approach Based On Positive Human Gain (2024)0.00
- Maximum Entropy Population-based Training For Zero-shot Human-ai Coordination (2021)0.00
- Collaboration Of AI Agents Via Cooperative Multi-agent Deep Reinforcement Learning (2019)0.00
- Training Generalizable Collaborative Agents Via Strategic Risk Aversion (2026)0.00