Decision Market Based Learning For Multi-agent Contextual Bandit Problems
2022 Β· Wenlong Wang, Thomas Pfeiffer
Abstract
Information is often stored in a distributed and proprietary form, and agents who own information are often self-interested and require incentives to reveal their information. Suitable mechanisms are required to elicit and aggregate such distributed information for decision making. In this paper, we use simulations to investigate the use of decision markets as mechanisms in a multi-agent learning system to aggregate distributed information for decision-making in a contextual bandit problem. The system utilises strictly proper decision scoring rules to assess the accuracy of probabilistic reports from agents, which allows agents to learn to solve the contextual bandit problem jointly. Our simulations show that our multi-agent system with distributed information can be trained as efficiently as a centralised counterpart with a single agent that receives all information. Moreover, we use our system to investigate scenarios with deterministic decision scoring rules which are not incentive
Authors
(none)
Tags
Stats
Related papers
- A New Bandit Setting Balancing Information From State Evolution And Corrupted Context (2020)0.00
- Online Learning For Cooperative Multi-player Multi-armed Bandits (2021)5.24
- Online Learning With Costly Features In Non-stationary Environments (2023)0.00
- Bayesian Decision Making Around Experts (2025)0.00
- Unified Models Of Human Behavioral Agents In Bandits, Contextual Bandits And RL (2020)8.35
- Multi-agent Bandit Learning Through Heterogeneous Action Erasure Channels (2023)0.00
- Learning To Coordinate Under Threshold Rewards: A Cooperative Multi-agent Bandit Framework (2025)0.00
- Human-ai Learning Performance In Multi-armed Bandits (2018)7.50