Offline Critic-guided Diffusion Policy For Multi-user Delay-constrained Scheduling
2025 Β· Zhuoran Li, Ruishuo Chen, Hai Zhong, et al.
Abstract
Effective multi-user delay-constrained scheduling is crucial in various real-world applications, such as instant messaging, live streaming, and data center management. In these scenarios, schedulers must make real-time decisions to satisfy both delay and resource constraints without prior knowledge of system dynamics, which are often time-varying and challenging to estimate. Current learning-based methods typically require interactions with actual systems during the training stage, which can be difficult or impractical, as it is capable of significantly degrading system performance and incurring substantial service costs. To address these challenges, we propose a novel offline reinforcement learning-based algorithm, named \underline\{S\}cheduling By \underline\{O\}ffline Learning with \underline\{C\}ritic Guidance and \underline\{D\}iffusion Generation (SOCD), to learn efficient scheduling policies purely from pre-collected *offline data*. SOCD innovatively employs a diffusion-based po
Authors
(none)
Tags
Stats
Related papers
- Effective Multi-user Delay-constrained Scheduling With Deep Recurrent Reinforcement Learning (2022)7.16
- Diffusion Model-based Reinforcement Learning For Version Age Of Information Scheduling: Average And Tail-risk-sensitive Control (2026)0.00
- Diffusion Actor-critic: Formulating Constrained Policy Iteration As Diffusion Noise Regression For Offline Reinforcement Learning (2024)2.92
- Diffusion Policies With Value-conditional Optimization For Offline Reinforcement Learning (2025)0.00
- Madiff: Offline Multi-agent Learning With Diffusion Models (2023)2.26
- Preferred-action-optimized Diffusion Policies For Offline Reinforcement Learning (2024)0.00
- Contractive Diffusion Policies: Robust Action Diffusion Via Contractive Score-based Sampling With Differential Equations (2026)0.00
- Diffcps: Diffusion Model Based Constrained Policy Search For Offline Reinforcement Learning (2023)1.91