Dynamics Of Resource Allocation In O-rans: An In-depth Exploration Of On-policy And Off-policy Deep Reinforcement Learning For Real-time Applications
2024 Β· Manal Mehdaoui, Amine Abouaomar
Abstract
Deep Reinforcement Learning (DRL) is a powerful tool used for addressing complex challenges in mobile networks. This paper investigates the application of two DRL models, on-policy and off-policy, in the field of resource allocation for Open Radio Access Networks (O-RAN). The on-policy model is the Proximal Policy Optimization (PPO), and the off-policy model is the Sample Efficient Actor-Critic with Experience Replay (ACER), which focuses on resolving the challenges of resource allocation associated with a Quality of Service (QoS) application that has strict requirements. Motivated by the original work of Nessrine Hammami and Kim Khoa Nguyen, this study is a replication to validate and prove the findings. Both PPO and ACER are used within the same experimental setup to assess their performance in a scenario of latency-sensitive and latency-tolerant users and compare them. The aim is to verify the efficacy of on-policy and off-policy DRL models in the context of O-RAN resource allocatio
Authors
(none)
Tags
Stats
Related papers
- Average Reward Reinforcement Learning For Wireless Radio Resource Management (2025)2.26
- Meta-reinforcement Learning For Fast And Data-efficient Spectrum Allocation In Dynamic Wireless Networks (2025)0.00
- Deep Reinforcement Learning For Distributed Uncoordinated Cognitive Radios Resource Allocation (2019)0.00
- Deep Reinforcement Learning For Distributed And Uncoordinated Cognitive Radios Resource Allocation (2022)0.00
- Offline And Distributional Reinforcement Learning For Radio Resource Management (2024)0.00
- FORLORN: A Framework For Comparing Offline Methods And Reinforcement Learning For Optimization Of RAN Parameters (2022)0.00
- Safe And Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach (2023)11.29
- Resource Management In Wireless Networks Via Multi-agent Deep Reinforcement Learning (2020)16.43