Bandit Approach To Conflict-free Multi-agent Q-learning In View Of Photonic Implementation
2022 · Hiroaki Shinkawa, Nicolas Chauvet, André Röhm, et al.
Abstract
Recently, extensive studies on photonic reinforcement learning to accelerate the process of calculation by exploiting the physical nature of light have been conducted. Previous studies utilized quantum interference of photons to achieve collective decision-making without choice conflicts when solving the competitive multi-armed bandit problem, a fundamental example of reinforcement learning. However, the bandit problem deals with a static environment where the agent's action does not influence the reward probabilities. This study aims to extend the conventional approach to a more general multi-agent reinforcement learning targeting the grid world problem. Unlike the conventional approach, the proposed scheme deals with a dynamic environment where the reward changes because of agents' actions. A successful photonic reinforcement learning scheme requires both a photonic system that contributes to the quality of learning and a suitable algorithm. This study proposes a novel learning algor
Authors
(none)
Tags
Stats
Related papers
- Parallel Bandit Architecture Based On Laser Chaos For Reinforcement Learning (2022)0.00
- Decentralized Multi-agent Reinforcement Learning Algorithm Using A Cluster-synchronized Laser Network (2024)0.00
- Scalable Photonic Reinforcement Learning By Time-division Multiplexing Of Laser Chaos (2018)13.05
- MADQRL: Distributed Quantum Reinforcement Learning Framework For Multi-agent Environments (2026)0.00
- Quantum Natural Policy Gradients: Towards Sample-efficient Reinforcement Learning (2023)7.16
- Towards Multi-agent Reinforcement Learning Using Quantum Boltzmann Machines (2021)0.00
- An Optical Control Environment For Benchmarking Reinforcement Learning Algorithms (2022)0.00
- Multi-agent Quantum Reinforcement Learning Using Evolutionary Optimization (2023)5.24