Superhuman AI For Stratego Using Self-play Reinforcement Learning And Test-time Search
2025 Β· Samuel Sokota, Eugene Vinitsky, Hengyuan Hu, et al.
Abstract
Few classical games have been regarded as such significant benchmarks of artificial intelligence as to have justified training costs in the millions of dollars. Among these, Stratego -- a board wargame exemplifying the challenge of strategic decision making under massive amounts of hidden information -- stands apart as a case where such efforts failed to produce performance at the level of top humans. This work establishes a step change in both performance and cost for Stratego, showing that it is now possible not only to reach the level of top humans, but to achieve vastly superhuman level -- and that doing so requires not an industrial budget, but merely a few thousand dollars. We achieved this result by developing general approaches for self-play reinforcement learning and test-time search under imperfect information.
Authors
(none)
Tags
Stats
Related papers
- Artificial Generals Intelligence: Mastering Generals.io With Reinforcement Learning (2025)0.00
- Reinforcement Learning In Strategy-based And Atari Games: A Review Of Google Deepminds Innovations (2025)0.00
- Reinforcing Competitive Multi-agents For Playing 'so Long Sucker' (2024)0.00
- Impartial Games: A Challenge For Reinforcement Learning (2022)0.00
- SCC: An Efficient Deep Reinforcement Learning Agent Mastering The Game Of Starcraft II (2020)0.00
- A Human Mixed Strategy Approach To Deep Reinforcement Learning (2018)7.50
- Reinforcement Learning On Human Decision Models For Uniquely Collaborative AI Teammates (2021)0.00
- Evaluation Of Human-ai Teams For Learned And Rule-based Agents In Hanabi (2021)0.00