Decentralized Federated Reinforcement Learning For User-centric Dynamic TFDD Control
2022 Β· Ziyan Yin, Zhe Wang, Jun Li, et al.
Abstract
The explosive growth of dynamic and heterogeneous data traffic brings great challenges for 5G and beyond mobile networks. To enhance the network capacity and reliability, we propose a learning-based dynamic time-frequency division duplexing (D-TFDD) scheme that adaptively allocates the uplink and downlink time-frequency resources of base stations (BSs) to meet the asymmetric and heterogeneous traffic demands while alleviating the inter-cell interference. We formulate the problem as a decentralized partially observable Markov decision process (Dec-POMDP) that maximizes the long-term expected sum rate under the users' packet dropping ratio constraints. In order to jointly optimize the global resources in a decentralized manner, we propose a federated reinforcement learning (RL) algorithm named federated Wolpertinger deep deterministic policy gradient (FWDDPG) algorithm. The BSs decide their local time-frequency configurations through RL algorithms and achieve global training via exchangi
Authors
(none)
Tags
Stats
Related papers
- Meta-reinforcement Learning For Fast And Data-efficient Spectrum Allocation In Dynamic Wireless Networks (2025)0.00
- Multi-agent Reinforcement Learning For Adaptive User Association In Dynamic Mmwave Networks (2020)0.00
- The Cost Of Learning: Efficiency Vs. Efficacy Of Learning-based RRM For 6G (2022)0.00
- Resource Management In Wireless Networks Via Multi-agent Deep Reinforcement Learning (2020)16.43
- Small-scale-fading-aware Resource Allocation In Wireless Federated Learning (2025)0.00
- A Policy-driven DRL Framework For System-level Tradeoff Control In Nr-u/wi-fi Coexistence (2026)0.00
- Effective Multi-user Delay-constrained Scheduling With Deep Recurrent Reinforcement Learning (2022)7.16
- Dynamic Channel Access Via Meta-reinforcement Learning (2021)5.84