Inducing Stackelberg Equilibrium Through Spatio-temporal Sequential Decision-making In Multi-agent Reinforcement Learning
2023 Β· Bin Zhang, Lijuan Li, Zhiwei Xu, et al.
Abstract
In multi-agent reinforcement learning (MARL), self-interested agents attempt to establish equilibrium and achieve coordination depending on game structure. However, existing MARL approaches are mostly bound by the simultaneous actions of all agents in the Markov game (MG) framework, and few works consider the formation of equilibrium strategies via asynchronous action coordination. In view of the advantages of Stackelberg equilibrium (SE) over Nash equilibrium, we construct a spatio-temporal sequential decision-making structure derived from the MG and propose an N-level policy model based on a conditional hypernetwork shared by all agents. This approach allows for asymmetric training with symmetric execution, with each agent responding optimally conditioned on the decisions made by superior agents. Agents can learn heterogeneous SE policies while still maintaining parameter sharing, which leads to reduced cost for learning and storage and enhanced scalability as the number of agents in
Authors
(none)
Tags
Stats
Related papers
- Stackelberg Decision Transformer For Asynchronous Action Coordination In Multi-agent Systems (2023)0.00
- Equilibrium Selection For Multi-agent Reinforcement Learning: A Unified Framework (2024)0.00
- Stackelberg Games For Learning Emergent Behaviors During Competitive Autocurricula (2023)5.84
- Oracles & Followers: Stackelberg Equilibria In Deep Multi-agent Reinforcement Learning (2022)0.00
- Dealing With Non-stationarity In Decentralized Cooperative Multi-agent Deep Reinforcement Learning Via Multi-timescale Learning (2023)0.00
- Maximum Entropy Heterogeneous-agent Reinforcement Learning (2023)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00
- Hierarchical Deep Multiagent Reinforcement Learning With Temporal Abstraction (2018)0.00