Algorithmic Pricing With Independent Learners And Relative Experience Replay
2021 Β· Bingyan Han
Abstract
In an infinitely repeated general-sum pricing game, independent reinforcement learners may exhibit collusive behavior without any communication, raising concerns about algorithmic collusion. To better understand the learning dynamics, we incorporate agents' relative performance (RP) among competitors using experience replay (ER) techniques. Experimental results indicate that RP considerations play a critical role in long-run outcomes. Agents that are averse to underperformance converge to the Bertrand-Nash equilibrium, while those more tolerant of underperformance tend to charge supra-competitive prices. This finding also helps mitigate the overfitting issue in independent Q-learning. Additionally, the impact of relative ER varies with the number of agents and the choice of algorithms.
Authors
(none)
Tags
Stats
Related papers
- Multi-agent Reinforcement Learning For Market Making: Competition Without Collusion (2025)0.00
- Learning A Game By Paying The Agents (2025)0.00
- The Bounds Of Algorithmic Collusion; \(q\)-learning, Gradient Learning, And The Folk Theorem (2024)0.00
- Loss Aversion Fosters Coordination Among Independent Reinforcement Learners (2019)0.00
- Impact Of Decentralized Learning On Player Utilities In Stackelberg Games (2024)0.00
- Reciprocal Reward Influence Encourages Cooperation From Self-interested Agents (2024)1.91
- Cheap Talking Algorithms (2023)0.00
- Learning Equilibria From Data: Provably Efficient Multi-agent Imitation Learning (2025)0.00