Reinforcement Learning For Intensity Control: An Application To Choice-based Network Revenue Management
2024 Β· Huiling Meng, Ningyuan Chen, Xuefeng Gao
Abstract
Intensity control is a class of continuous-time dynamic optimization problems with many important applications in Operations Research including queueing and revenue management. In this study, we propose a practical continuous-time reinforcement learning framework for intensity control using choice-based network revenue management as a case study, which is a classical problem in revenue management that features a large state space, a large action space, and a continuous time horizon. We show that by leveraging the event-driven structure of the problem and the inherent discretization of sample paths created by the state-jump times, a defining feature of intensity control, one does not need to discretize the time horizon in advance. We adapt discrete-time Monte Carlo and temporal difference learning algorithms for policy evaluation to continuous time and develop policy-gradient-based actor-critic algorithms for event-driven intensity control. Through a comprehensive numerical study, we ev
Authors
(none)
Tags
Stats
Related papers
- Reinforcement Learning For Datacenter Congestion Control (2021)0.00
- Intervention-assisted Policy Gradient Methods For Online Stochastic Queuing Network Optimization: Technical Report (2024)0.00
- A General Markov Decision Process Framework For Directly Learning Optimal Control Policies (2019)0.00
- Attraction-repulsion Actor-critic For Continuous Control Reinforcement Learning (2019)0.00
- When To Sense And Control? A Time-adaptive Approach For Continuous-time RL (2024)0.00
- A Tour Of Reinforcement Learning: The View From Continuous Control (2018)19.86
- Actor Critic Learning Algorithms For Mean-field Control With Moment Neural Networks (2023)0.00
- Policy Optimization In A Noisy Neighborhood: On Return Landscapes In Continuous Control (2023)0.00