Balancing Two-player Stochastic Games With Soft Q-learning
2018 Β· Jordi Grau-Moya, Felix Leibfried, Haitham Bou-Ammar
Abstract
Within the context of video games the notion of perfectly rational agents can be undesirable as it leads to uninteresting situations, where humans face tough adversarial decision makers. Current frameworks for stochastic games and reinforcement learning prohibit tuneable strategies as they seek optimal performance. In this paper, we enable such tuneable behaviour by generalising soft Q-learning to stochastic games, where more than one agent interact strategically. We contribute both theoretically and empirically. On the theory side, we show that games with soft Q-learning exhibit a unique value and generalise team games and zero-sum games far beyond these two extremes to cover a continuous spectrum of gaming behaviour. Experimentally, we show how tuning agents' constraints affect performance and demonstrate, through a neural network architecture, how to reliably balance games with high-dimensional representations.
Authors
(none)
Tags
Stats
Related papers
- Exploration-exploitation In Multi-agent Competition: Convergence With Bounded Rationality (2021)0.00
- Beyond Strict Competition: Approximate Convergence Of Multi Agent Q-learning Dynamics (2023)0.00
- Feature-based Q-learning For Two-player Stochastic Games (2019)0.00
- Partial-information Q-learning For General Two-player Stochastic Games (2023)0.00
- An Information-theoretic Optimality Principle For Deep Reinforcement Learning (2017)0.00
- On The Stability Of Learning In Network Games With Many Players (2024)0.00
- Asymptotic Convergence And Performance Of Multi-agent Q-learning Dynamics (2023)0.00
- Logit-q Dynamics For Efficient Learning In Stochastic Teams (2023)0.00