Knowpc: Knowledge-driven Programmatic Reinforcement Learning For Zero-shot Coordination
2024 Β· Yin Gu, Qi Liu, Zhi Li, et al.
Abstract
Zero-shot coordination (ZSC) remains a major challenge in the cooperative AI field, which aims to learn an agent to cooperate with an unseen partner in training environments or even novel environments. In recent years, a popular ZSC solution paradigm has been deep reinforcement learning (DRL) combined with advanced self-play or population-based methods to enhance the neural policy's ability to handle unseen partners. Despite some success, these approaches usually rely on black-box neural networks as the policy function. However, neural networks typically lack interpretability and logic, making the learned policies difficult for partners (e.g., humans) to understand and limiting their generalization ability. These shortcomings hinder the application of reinforcement learning methods in diverse cooperative scenarios.We suggest to represent the agent's policy with an interpretable program. Unlike neural networks, programs contain stable logic, but they are non-differentiable and difficult
Authors
(none)
Tags
Stats
Related papers
- Noisy Zero-shot Coordination: Breaking The Common Knowledge Assumption In Zero-shot Coordination Games (2024)0.00
- Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In The Game Of Hanabi (2023)0.00
- Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination (2025)0.00
- Tackling Cooperative Incompatibility For Zero-shot Human-ai Coordination (2023)0.00
- A New Formalism, Method And Open Issues For Zero-shot Coordination (2021)0.00
- Zero Shot Coordination For Sparse Reward Tasks With Diverse Reward Shapings (2026)0.00
- Heterogeneous Multi-agent Zero-shot Coordination By Coevolution (2022)5.24
- "other-play" For Zero-shot Coordination (2020)0.00