Practical Policy Distillation For Reinforcement Learning In Radio Access Networks
2025 Β· Sara Khosravi, Burak Demirel, Linghui Zhou, et al.
Abstract
Adopting artificial intelligence (AI) in radio access networks (RANs) presents several challenges, including limited availability of link-level measurements (e.g., CQI reports), stringent real-time processing constraints (e.g., sub-1 ms per TTI), and network heterogeneity (different spectrum bands, cell types, and vendor equipment). A critical yet often overlooked barrier lies in the computational and memory limitations of RAN baseband hardware, particularly in legacy 4th Generation (4G) systems, which typically lack on-chip neural accelerators. As a result, only lightweight AI models (under 1 Mb and sub-100~\mu s inference time) can be effectively deployed, limiting both their performance and applicability. However, achieving strong generalization across diverse network conditions often requires large-scale models with substantial resource demands. To address this trade-off, this paper investigates policy distillation in the context of a reinforcement learning-based link adaptation ta
Authors
(none)
Tags
Stats
Related papers
- Generalization In Reinforcement Learning For Radio Access Networks (2025)0.00
- Deep Reinforcement Learning For Distributed And Uncoordinated Cognitive Radios Resource Allocation (2022)0.00
- Sim2real For Reinforcement Learning Driven Next Generation Networks (2022)0.00
- Deep Reinforcement Learning For Distributed Uncoordinated Cognitive Radios Resource Allocation (2019)0.00
- Cognitive Radio Network Throughput Maximization With Deep Reinforcement Learning (2020)4.52
- Meta-reinforcement Learning For Fast And Data-efficient Spectrum Allocation In Dynamic Wireless Networks (2025)0.00
- Anomaly Detection For Scalable Task Grouping In Reinforcement Learning-based RAN Optimization (2023)0.00
- Cooperative Multi-agent Reinforcement Learning For Low-level Wireless Communication (2018)0.00