Meta-reinforcement Learning For Fast And Data-efficient Spectrum Allocation In Dynamic Wireless Networks
2025 Β· Oluwaseyi Giwa, Tobi Awodunmila, Muhammad Ahmed Mohsin, et al.
Abstract
The dynamic allocation of spectrum in 5G / 6G networks is critical to efficient resource utilization. However, applying traditional deep reinforcement learning (DRL) is often infeasible due to its immense sample complexity and the safety risks associated with unguided exploration, which can cause severe network interference. To address these challenges, we propose a meta-learning framework that enables agents to learn a robust initial policy and rapidly adapt to new wireless scenarios with minimal data. We implement three meta-learning architectures, model-agnostic meta-learning (MAML), recurrent neural network (RNN), and an attention-enhanced RNN, and evaluate them against a non-meta-learning DRL algorithm, proximal policy optimization (PPO) baseline, in a simulated dynamic integrated access/backhaul (IAB) environment. Our results show a clear performance gap. The attention-based meta-learning agent reaches a peak mean network throughput of 48 Mbps, while the PPO baseline decreased dr
Authors
(none)
Tags
Stats
Related papers
- Dynamic Channel Access Via Meta-reinforcement Learning (2021)5.84
- Deep Reinforcement Learning For Joint Spectrum And Power Allocation In Cellular Networks (2020)0.00
- The Cost Of Learning: Efficiency Vs. Efficacy Of Learning-based RRM For 6G (2022)0.00
- Dynamics Of Resource Allocation In O-rans: An In-depth Exploration Of On-policy And Off-policy Deep Reinforcement Learning For Real-time Applications (2024)2.26
- Dynamic Spectrum Access For Ambient Backscatter Communication-assisted D2D Systems With Quantum Reinforcement Learning (2024)0.00
- Resource Management In Wireless Networks Via Multi-agent Deep Reinforcement Learning (2020)16.43
- Enhancing Spectrum Efficiency In 6G Satellite Networks: A Gail-powered Policy Learning Via Asynchronous Federated Inverse Reinforcement Learning (2024)0.00
- Deep Reinforcement Learning For Distributed Uncoordinated Cognitive Radios Resource Allocation (2019)0.00