HARP: Human-assisted Regrouping With Permutation Invariant Critic For Multi-agent Reinforcement Learning
2024 Β· Huawen Hu, Enze Shi, Chenxi Yue, et al.
Abstract
Human-in-the-loop reinforcement learning integrates human expertise to accelerate agent learning and provide critical guidance and feedback in complex fields. However, many existing approaches focus on single-agent tasks and require continuous human involvement during the training process, significantly increasing the human workload and limiting scalability. In this paper, we propose HARP (Human-Assisted Regrouping with Permutation Invariant Critic), a multi-agent reinforcement learning framework designed for group-oriented tasks. HARP integrates automatic agent regrouping with strategic human assistance during deployment, enabling and allowing non-experts to offer effective guidance with minimal intervention. During training, agents dynamically adjust their groupings to optimize collaborative task completion. When deployed, they actively seek human assistance and utilize the Permutation Invariant Group Critic to evaluate and refine human-proposed groupings, allowing non-expert users t
Authors
(none)
Tags
Stats
Related papers
- Heterogeneous-agent Reinforcement Learning (2023)0.00
- Halypo: Heterogeneous-agent Lyapunov Policy Optimization For Human-robot Collaboration (2026)0.00
- Hierarchical Reinforcement Learning For Optimal Agent Grouping In Cooperative Systems (2025)0.00
- GHQ: Grouped Hybrid Q Learning For Heterogeneous Cooperative Multi-agent Reinforcement Learning (2023)6.34
- Maximum Entropy Heterogeneous-agent Reinforcement Learning (2023)0.00
- Learning Heterogeneous Agent Cooperation Via Multiagent League Training (2022)7.16
- Heterogeneous Multi-robot Reinforcement Learning (2023)6.77
- Hierarchical Multi-agent Reinforcement Learning For Air Combat Maneuvering (2023)8.82