N-agent Ad Hoc Teamwork
2024 Β· Caroline Wang, Arrasy Rahman, Ishan Durugkar, et al.
Abstract
Current approaches to learning cooperative multi-agent behaviors assume relatively restrictive settings. In standard fully cooperative multi-agent reinforcement learning, the learning algorithm controls \(\textit\{all\}\) agents in the scenario, while in ad hoc teamwork, the learning algorithm usually assumes control over only a \(\textit\{single\}\) agent in the scenario. However, many cooperative settings in the real world are much less restrictive. For example, in an autonomous driving scenario, a company might train its cars with the same learning algorithm, yet once on the road, these cars must cooperate with cars from another company. Towards expanding the class of scenarios that cooperative learning methods may optimally address, we introduce \(N\)-agent ad hoc teamwork (NAHT), where a set of autonomous agents must interact and cooperate with dynamically varying numbers and types of teammates. This paper formalizes the problem, and proposes the Policy Optimization with Agent Mod
Authors
(none)
Tags
Stats
Related papers
- Open Ad Hoc Teamwork With Cooperative Game Theory (2024)1.40
- A General Learning Framework For Open Ad Hoc Teamwork Using Graph-based Policy Learning (2022)0.00
- Generating Teammates For Training Robust Ad Hoc Teamwork Agents Via Best-response Diversity (2022)0.00
- Learning To Coordinate With Anyone (2023)0.00
- Zero-shot Coordination In Ad Hoc Teams With Generalized Policy Improvement And Difference Rewards (2025)0.00
- Learning Heterogeneous Agent Cooperation Via Multiagent League Training (2022)7.16
- Coordinating Fully-cooperative Agents Using Hierarchical Learning Anticipation (2023)0.00
- Behavioral Differences Is The Key Of Ad-hoc Team Cooperation In Multiplayer Games Hanabi (2023)0.00