Efficient Episodic Learning Of Nonstationary And Unknown Zero-sum Games Using Expert Game Ensembles
2021 Β· Yunian Pan, Quanyan Zhu
Abstract
Game theory provides essential analysis in many applications of strategic interactions. However, the question of how to construct a game model and what is its fidelity is seldom addressed. In this work, we consider learning in a class of repeated zero-sum games with unknown, time-varying payoff matrix, and noisy feedbacks, by making use of an ensemble of benchmark game models. These models can be pre-trained and collected dynamically during sequential plays. They serve as prior side information and imperfectly underpin the unknown true game model. We propose OFULinMat, an episodic learning algorithm that integrates the adaptive estimation of game models and the learning of the strategies. The proposed algorithm is shown to achieve a sublinear bound on the saddle-point regret. We show that this algorithm is provably efficient through both theoretical analysis and numerical examples. We use a dynamic honeypot allocation game as a case study to illustrate and corroborate our results. We a
Authors
(none)
Tags
Stats
Related papers
- Bayesian Learning In Episodic Zero-sum Games (2026)0.00
- Model-free Learning For Two-player Zero-sum Partially Observable Markov Games With Perfect Recall (2021)0.00
- Efficient Exploration Of Zero-sum Stochastic Games (2020)0.00
- Online Learning In Unknown Markov Games (2020)0.00
- Last-iterate Convergence Of Payoff-based Independent Learning In Zero-sum Stochastic Games (2024)0.00
- Learning With Episodic Hypothesis Testing In General Games: A Framework For Equilibrium Selection (2025)0.00
- Learning In Zero-sum Markov Games: Relaxing Strong Reachability And Mixing Time Assumptions (2023)0.00
- Convergence Of Heterogeneous Learning Dynamics In Zero-sum Stochastic Games (2023)2.26