Deep Reinforcement Learning Approach To MIMO Precoding Problem: Optimality And Robustness
2020 Β· Heunchul Lee, Maksym Girnyk, Jaeseong Jeong
Abstract
In this paper, we propose a deep reinforcement learning (RL)-based precoding framework that can be used to learn an optimal precoding policy for complex multiple-input multiple-output (MIMO) precoding problems. We model the precoding problem for a single-user MIMO system as an RL problem in which a learning agent sequentially selects the precoders to serve the environment of MIMO system based on contextual information about the environmental conditions, while simultaneously adapting the precoder selection policy based on the reward feedback from the environment to maximize a numerical reward signal. We develop the RL agent with two canonical deep RL (DRL) algorithms, namely deep Q-network (DQN) and deep deterministic policy gradient (DDPG). To demonstrate the optimality of the proposed DRL-based precoding framework, we explicitly consider a simple MIMO environment for which the optimal solution can be obtained analytically and show that DQN- and DDPG-based agents can learn the near-opt
Authors
(none)
Tags
Stats
Related papers
- Multi-agent Deep Reinforcement Learning (MADRL) Meets Multi-user MIMO Systems (2021)7.50
- Channel Estimation Via Successive Denoising In MIMO OFDM Systems: A Reinforcement Learning Approach (2021)9.23
- Offline Reinforcement Learning For Wireless Network Optimization With Mixture Datasets (2023)9.59
- A General Markov Decision Process Framework For Directly Learning Optimal Control Policies (2019)0.00
- Meta-reinforcement Learning For Fast And Data-efficient Spectrum Allocation In Dynamic Wireless Networks (2025)0.00
- Dynamic Channel Access Via Meta-reinforcement Learning (2021)5.84
- Policy Search Using Dynamic Mirror Descent MPC For Model Free Off Policy RL (2021)0.00
- Dual RL: Unification And New Methods For Reinforcement And Imitation Learning (2023)0.00