Effective Multi-user Delay-constrained Scheduling With Deep Recurrent Reinforcement Learning
2022 Β· Pihe Hu, Ling Pan, Yu Chen, et al.
Abstract
Multi-user delay constrained scheduling is important in many real-world applications including wireless communication, live streaming, and cloud computing. Yet, it poses a critical challenge since the scheduler needs to make real-time decisions to guarantee the delay and resource constraints simultaneously without prior information of system dynamics, which can be time-varying and hard to estimate. Moreover, many practical scenarios suffer from partial observability issues, e.g., due to sensing noise or hidden correlation. To tackle these challenges, we propose a deep reinforcement learning (DRL) algorithm, named Recurrent Softmax Delayed Deep Double Deterministic Policy Gradient (\(\mathtt\{RSD4\}\)), which is a data-driven method based on a Partially Observed Markov Decision Process (POMDP) formulation. \(\mathtt\{RSD4\}\) guarantees resource and delay constraints by Lagrangian dual and delay-sensitive queues, respectively. It also efficiently tackles partial observability with a mem
Authors
(none)
Tags
Stats
Related papers
- Offline Critic-guided Diffusion Policy For Multi-user Delay-constrained Scheduling (2025)0.00
- Decentralized Task Scheduling In Distributed Systems: A Deep Reinforcement Learning Approach (2026)0.00
- A Multi-task Approach To Robust Deep Reinforcement Learning For Resource Allocation (2023)0.00
- Revisiting State Augmentation Methods For Reinforcement Learning With Stochastic Delays (2021)10.35
- Structure-enhanced Deep Reinforcement Learning For Optimal Transmission Scheduling (2022)3.58
- Learning To Schedule Communication In Multi-agent Reinforcement Learning (2019)0.00
- Model-based Reinforcement Learning Under Random Observation Delays (2025)0.00
- Prioritizing Latency With Profit: A Drl-based Admission Control For 5G Network Slices (2025)0.00