Memory-two Strategies Forming Symmetric Mutual Reinforcement Learning Equilibrium In Repeated Prisoners' Dilemma Game
2021 Β· Masahiko Ueda
Abstract
We investigate symmetric equilibria of mutual reinforcement learning when both players alternately learn the optimal memory-two strategies against the opponent in the repeated prisoners' dilemma game. We provide a necessary condition for memory-two deterministic strategies to form symmetric equilibria. We then provide three examples of memory-two deterministic strategies which form symmetric mutual reinforcement learning equilibria. We also prove that mutual reinforcement learning equilibria formed by memory-two strategies are also mutual reinforcement learning equilibria when both players use reinforcement learning of memory-\(n\) strategies with \(n>2\).
Authors
(none)
Tags
Stats
Related papers
- Symmetric Equilibrium Of Multi-agent Reinforcement Learning In Repeated Prisoner's Dilemma (2021)8.60
- Memory Asymmetry Creates Heteroclinic Orbits To Nash Equilibrium In Learning In Zero-sum Games (2023)0.00
- Learning In Multi-memory Games Triggers Complex Dynamics Diverging From Nash Equilibrium (2023)0.00
- How Memory Architecture Affects Learning In A Simple POMDP: The Two-hypothesis Testing Problem (2021)0.00
- On The Impossibility Of Convergence Of Mixed Strategies With No Regret Learning (2020)0.00
- Evaluation And Learning In Two-player Symmetric Games Via Best And Better Responses (2022)0.00
- On The Emergence Of Cooperation In The Repeated Prisoner's Dilemma (2022)0.00
- A Dual Memory Structure For Efficient Use Of Replay Memory In Deep Reinforcement Learning (2019)0.00