Explore Reinforced: Equilibrium Approximation With Reinforcement Learning
2024 Β· Ryan Yu, Mateusz Nowak, Qintong Xie, et al.
Abstract
Current approximate Coarse Correlated Equilibria (CCE) algorithms struggle with equilibrium approximation for games in large stochastic environments but are theoretically guaranteed to converge to a strong solution concept. In contrast, modern Reinforcement Learning (RL) algorithms provide faster training yet yield weaker solutions. We introduce Exp3-IXrl - a blend of RL and game-theoretic approach, separating the RL agent's action selection from the equilibrium computation while preserving the integrity of the learning process. We demonstrate that our algorithm expands the application of equilibrium approximation algorithms to new environments. Specifically, we show the improved performance in a complex and adversarial cybersecurity network environment - the Cyber Operations Research Gym - and in the classical multi-armed bandit settings.
Authors
(none)
Tags
Stats
Related papers
- Strategically Robust Multi-agent Reinforcement Learning With Linear Function Approximation (2026)0.00
- Score-based Equilibrium Learning In Multi-player Finite Games With Imperfect Information (2023)0.00
- Breaking The Curse Of Multiagents In A Large State Space: RL In Markov Games With Independent Linear Function Approximation (2023)0.00
- Unified Algorithms For RL With Decision-estimation Coefficients: PAC, Reward-free, Preference-based Learning, And Beyond (2022)5.24
- Near Optimal Convergence To Coarse Correlated Equilibrium In General-sum Markov Games (2025)0.00
- Distributionally Robust Online Markov Game With Linear Function Approximation (2025)0.00
- Reinforcement Learning With Non-ergodic Reward Increments: Robustness Via Ergodicity Transformations (2023)0.00
- Minimax-optimal Multi-agent RL In Markov Games With A Generative Model (2022)2.26