The Dynamics Of Q-learning In Population Games: A Physics-inspired Continuity Equation Model
2022 Β· Shuyue Hu, Chin-Wing Leung, Ho-Fung Leung, et al.
Abstract
Although learning has found wide application in multi-agent systems, its effects on the temporal evolution of a system are far from understood. This paper focuses on the dynamics of Q-learning in large-scale multi-agent systems modeled as population games. We revisit the replicator equation model for Q-learning dynamics and observe that this model is inappropriate for our concerned setting. Motivated by this, we develop a new formal model, which bears a formal connection with the continuity equation in physics. We show that our model always accurately describes the Q-learning dynamics in population games across different initial settings of MASs and game configurations. We also show that our model can be applied to different exploration mechanisms, describe the mean dynamics, and be extended to Q-learning in 2-player and n-player games. Last but not least, we show that our model can provide insights into algorithm parameters and facilitate parameter tuning.
Authors
(none)
Tags
Stats
Related papers
- The Evolutionary Dynamics Of Independent Learning Agents In Population Games (2020)0.00
- Deterministic Model Of Incremental Multi-agent Boltzmann Q-learning: Transient Cooperation, Metastability, And Oscillations (2024)0.00
- On The Stability Of Learning In Network Games With Many Players (2024)0.00
- Asymptotic Convergence And Performance Of Multi-agent Q-learning Dynamics (2023)0.00
- Convergence And Connectivity: Dynamics Of Multi-agent Q-learning In Random Networks (2025)0.00
- Beyond Strict Competition: Approximate Convergence Of Multi Agent Q-learning Dynamics (2023)0.00
- Learning In Multi-memory Games Triggers Complex Dynamics Diverging From Nash Equilibrium (2023)0.00
- Logit-q Dynamics For Efficient Learning In Stochastic Teams (2023)0.00