Polymatrix Competitive Gradient Descent
2021 · Jeffrey Ma, Alistair Letcher, Florian Schäfer, et al.
Abstract
Many economic games and machine learning approaches can be cast as competitive optimization problems where multiple agents are minimizing their respective objective function, which depends on all agents' actions. While gradient descent is a reliable basic workhorse for single-agent optimization, it often leads to oscillation in competitive optimization. In this work we propose polymatrix competitive gradient descent (PCGD) as a method for solving general sum competitive optimization involving arbitrary numbers of agents. The updates of our method are obtained as the Nash equilibria of a local polymatrix approximation with a quadratic regularization, and can be computed efficiently by solving a linear system of equations. We prove local convergence of PCGD to stable fixed points for \(n\)-player general-sum games, and show that it does not require adapting the step size to the strength of the player-interactions. We use PCGD to optimize policies in multi-agent reinforcement learning and
Authors
(none)
Tags
Stats
Related papers
- Competitive Policy Optimization (2020)0.00
- Asynchronous Gradient Play In Zero-sum Multi-agent Games (2022)0.00
- Generalized Individual Q-learning For Polymatrix Games With Partial Observations (2024)2.26
- Last-iterate Convergence Of Decentralized Optimistic Gradient Descent/ascent In Infinite-horizon Competitive Markov Games (2021)0.00
- Policy Optimization For Markov Games: Unified Framework And Faster Convergence (2022)0.00
- Independent Policy Gradient For Large-scale Markov Potential Games: Sharper Rates, Function Approximation, And Game-agnostic Convergence (2022)0.00
- Symmetric (optimistic) Natural Policy Gradient For Multi-agent Learning With Parameter Convergence (2022)0.00
- Policy Optimization For Continuous-time Linear-quadratic Graphon Mean Field Games (2025)0.00