Logit-q Dynamics For Efficient Learning In Stochastic Teams
2023 Β· Ahmed Said Donmez, Onur Unlu, Muhammed O. Sayin
Abstract
We present a new family of logit-Q dynamics for efficient learning in stochastic games by combining the log-linear learning (also known as logit dynamics) for the repeated play of normal-form games with Q-learning for unknown Markov decision processes within the auxiliary stage-game framework. In this framework, we view stochastic games as agents repeatedly playing some stage game associated with the current state of the underlying game while the agents' Q-functions determine the payoffs of these stage games. We show that the logit-Q dynamics presented reach (near) efficient equilibrium in stochastic teams with unknown dynamics and quantify the approximation error. We also show the rationality of the logit-Q dynamics against agents following pure stationary strategies and the convergence of the dynamics in stochastic games where the stage-payoffs induce potential games, yet only a single agent controls the state transitions beyond stochastic teams. The key idea is to approximate the dy
Authors
(none)
Tags
Stats
Related papers
- Episodic Logit-q Dynamics For Efficient Learning In Stochastic Teams (2022)0.00
- Beyond Strict Competition: Approximate Convergence Of Multi Agent Q-learning Dynamics (2023)0.00
- Independent And Decentralized Learning In Markov Potential Games (2022)0.00
- Two-timescale Q-learning With Function Approximation In Zero-sum Stochastic Games (2023)0.00
- Fictitious Play In Zero-sum Stochastic Games (2020)0.00
- Balancing Two-player Stochastic Games With Soft Q-learning (2018)0.00
- On The Heterogeneity Of Independent Learning Dynamics In Zero-sum Stochastic Games (2021)0.00
- A Finite-sample Analysis Of Payoff-based Independent Learning In Zero-sum Stochastic Games (2023)0.00