Abstract

Regret has been established as a foundational concept in online learning, and likewise has important applications in the analysis of learning dynamics in games. Regret quantifies the difference between a learner's performance against a baseline in hindsight. It is well-known that regret-minimizing algorithms converge to certain classes of equilibria in games; however, traditional forms of regret used in game theory predominantly consider baselines that permit deviations to deterministic actions or strategies. In this paper, we revisit our understanding of regret from the perspective of deviations over partitions of the full *mixed* strategy space (i.e., probability distributions over pure strategies), under the lens of the previously-established \(\Phi\)-regret framework, which provides a continuum of stronger regret measures. Importantly, \(\Phi\)-regret enables learning agents to consider deviations from and to mixed strategies, generalizing several existing notions of regret such as

Authors

(none)

Tags

  • Game AI

Stats

Related papers