Safe And Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach
2023 Β· Ahmad M. Nagib, Hatem Abou-Zeid, Hossam S. Hassanein
Abstract
The open radio access network (O-RAN) architecture supports intelligent network control algorithms as one of its core capabilities. Data-driven applications incorporate such algorithms to optimize radio access network (RAN) functions via RAN intelligent controllers (RICs). Deep reinforcement learning (DRL) algorithms are among the main approaches adopted in the O-RAN literature to solve dynamic radio resource management problems. However, despite the benefits introduced by the O-RAN RICs, the practical adoption of DRL algorithms in real network deployments falls behind. This is primarily due to the slow convergence and unstable performance exhibited by DRL agents upon deployment and when encountering previously unseen network conditions. In this paper, we address these challenges by proposing transfer learning (TL) as a core component of the training and deployment workflows for the DRL-based closed-loop control of O-RAN functionalities. To this end, we propose and design a hybrid TL-a
Authors
(none)
Tags
Stats
Related papers
- Network Slicing Via Transfer Learning Aided Distributed Deep Reinforcement Learning (2023)7.50
- Sim2real For Reinforcement Learning Driven Next Generation Networks (2022)0.00
- Dynamics Of Resource Allocation In O-rans: An In-depth Exploration Of On-policy And Off-policy Deep Reinforcement Learning For Real-time Applications (2024)2.26
- Generalization In Reinforcement Learning For Radio Access Networks (2025)0.00
- An Overview Of Machine Learning-enabled Optimization For Reconfigurable Intelligent Surfaces-aided 6G Networks: From Reinforcement Learning To Large Language Models (2024)0.00
- Prioritizing Latency With Profit: A Drl-based Admission Control For 5G Network Slices (2025)0.00
- Meta-reinforcement Learning For Fast And Data-efficient Spectrum Allocation In Dynamic Wireless Networks (2025)0.00
- The Cost Of Learning: Efficiency Vs. Efficacy Of Learning-based RRM For 6G (2022)0.00