Autocorrelation Effects In A Stochastic-process Model For Decision Making Via Time Series
2026 Β· Tomoki Yamagami, Mikio Hasegawa, Takatomo Mihana, et al.
Abstract
Decision makers exploiting photonic chaotic dynamics obtained by semiconductor lasers provide an ultrafast approach to solving multi-armed bandit problems by using a temporal optical signal as the driving source for sequential decisions. In such systems, the sampling interval of the chaotic waveform shapes the temporal correlation of the resulting time series, and experiments have reported that decision accuracy depends strongly on this autocorrelation property. However, it remains unclear whether the benefit of autocorrelation can be explained by a minimal mathematical model. Here, we analyze a stochastic-process model of the time-series-based decision making using the tug-of-war principle for solving the two-armed bandit problem, where the threshold and a two-valued Markov signal evolve jointly. Numerical results reveal an environment-dependent structure: negative (positive) autocorrelation is optimal in reward-rich (reward-poor) environments. These findings show that negative autoco
Authors
(none)
Tags
Stats
Related papers
- Scalable Photonic Reinforcement Learning By Time-division Multiplexing Of Laser Chaos (2018)13.05
- Restless Bandit Problem With Rewards Generated By A Linear Gaussian Dynamical System (2024)0.00
- Optimal Policies For Observing Time Series And Related Restless Bandit Problems (2017)0.00
- Parallel Bandit Architecture Based On Laser Chaos For Reinforcement Learning (2022)0.00
- Dynamic Decision-making Under Model Misspecification: A Stochastic Stability Approach (2026)0.00
- A Frequency-domain Analysis Of The Multi-armed Bandit Problem: A New Perspective On The Exploration-exploitation Trade-off (2025)0.00
- Decentralized Multi-agent Reinforcement Learning Algorithm Using A Cluster-synchronized Laser Network (2024)0.00
- Understanding Individual Decision-making In Multi-agent Reinforcement Learning: A Dynamical Systems Approach (2025)0.00