Analysis Of Thompson Sampling For Controlling Unknown Linear Diffusion Processes
2022 Β· Mohamad Kazem Shirani Faradonbeh, Sadegh Shirani, Mohsen Bayati
Abstract
Linear diffusion processes serve as canonical continuous-time models for dynamic decision-making under uncertainty. These systems evolve according to drift matrices that specify the instantaneous rates of change in the expected system state, while also experiencing continuous random disturbances modeled by Brownian noise. For instance, in medical applications such as artificial pancreas systems, the drift matrices represent the internal dynamics of glucose concentrations. Classical results in stochastic control provide optimal policies under perfect knowledge of the drift matrices. However, practical decision-making scenarios typically feature uncertainty about the drift; in medical contexts, such parameters are patient-specific and unknown, requiring adaptive policies for efficiently learning the drift matrices while ensuring system stability and optimal performance. We study the Thompson sampling (TS) algorithm for decision-making in linear diffusion processes with unknown drift ma
Authors
(none)
Tags
Stats
Related papers
- Diffusion Controller: Framework, Algorithms And Parameterization (2026)0.00
- Langevin Thompson Sampling With Logarithmic Communication: Bandits And Reinforcement Learning (2023)0.00
- Thompson Sampling For Infinite-horizon Discounted Decision Processes (2024)0.00
- Dynamic Decision-making Under Model Misspecification: A Stochastic Stability Approach (2026)0.00
- A Relaxed Technical Assumption For Posterior Sampling-based Reinforcement Learning For Control Of Unknown Linear Systems (2021)0.00
- Policy Gradient Optimization Of Thompson Sampling Policies (2020)0.00
- Understanding Sampler Stochasticity In Training Diffusion Models For RLHF (2025)0.00
- Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning (2026)0.00