Decision-making With Speculative Opponent Models
2022 Β· Jing Sun, Shuo Chen, Cong Zhang, et al.
Abstract
Opponent modelling has proven effective in enhancing the decision-making of the controlled agent by constructing models of opponent agents. However, existing methods often rely on access to the observations and actions of opponents, a requirement that is infeasible when such information is either unobservable or challenging to obtain. To address this issue, we introduce Distributional Opponent-aided Multi-agent Actor-Critic (DOMAC), the first speculative opponent modelling algorithm that relies solely on local information (i.e., the controlled agent's observations, actions, and rewards). Specifically, the actor maintains a speculated belief about the opponents using the tailored speculative opponent models that predict the opponents' actions using only local information. Moreover, DOMAC features distributional critic models that estimate the return distribution of the actor's policy, yielding a more fine-grained assessment of the actor's quality. This thus more effectively guides the t
Authors
(none)
Tags
Stats
Related papers
- Model-based Opponent Modeling (2021)0.00
- Option-critic In Cooperative Multi-agent Systems (2019)0.00
- Opponent Learning Awareness And Modelling In Multi-objective Normal Form Games (2020)7.16
- Variational Autoencoders For Opponent Modeling In Multi-agent Systems (2020)0.00
- Adaptive Opponent Policy Detection In Multi-agent Mdps: Real-time Strategy Switch Identification Using Running Error Estimation (2024)0.00
- Consistent Opponent Modeling In Imperfect-information Games (2025)0.00
- Metric Policy Representations For Opponent Modeling (2021)0.00
- Model-based Multi-agent Policy Optimization With Adaptive Opponent-wise Rollouts (2021)0.00