The Signaler-responder Game: Learning To Communicate Using Thompson Sampling
2024 Β· Radhika Bhuckory, Bhaskar Krishnamachari
Abstract
We are interested in studying how heterogeneous agents can learn to communicate and cooperate with each other without being explicitly pre-programmed to do so. Motivated by this goal, we present and analyze a distributed solution to a two-player signaler-responder game which is defined as follows. The signaler agent has a random, exogenous need and can choose from four different strategies: never signal, always signal, signal when need, and signal when no need. The responder agent can choose to either ignore or respond to the signal. We define a reward to both agents when they cooperate to satisfy the signaler's need, and costs associated with communication, response and unmet needs. We identify pure Nash equilibria of the game and the conditions under which they occur. As a solution for this game, we propose two new distributed Bayesian learning algorithms, one for each agent, based on the classic Thompson Sampling policy for multi-armed bandits. These algorithms allow both agents to
Authors
(none)
Tags
Stats
Related papers
- Multi-agent Coordination In Adversarial Environments Through Signal Mediated Strategies (2021)2.26
- Decentralized Optimal Equilibrium Learning In Stochastic Games Via Single-bit Feedback (2026)0.00
- Langevin Thompson Sampling With Logarithmic Communication: Bandits And Reinforcement Learning (2023)0.00
- Learning Practical Communication Strategies In Cooperative Multi-agent Reinforcement Learning (2022)0.00
- Seq2seq Mimic Games: A Signaling Perspective (2018)0.00
- Efficient Exploration Of Zero-sum Stochastic Games (2020)0.00
- Towards Cooperation In Sequential Prisoner's Dilemmas: A Deep Multiagent Reinforcement Learning Approach (2018)0.00
- Learning Multiagent Coordination In The Absence Of Communication Channels (2018)0.00