A Novel Switch-type Policy Network For Resource Allocation Problems: Technical Report
2025 Β· Jerrod Wigmore, Brooke Shrader, Eytan Modiano
Abstract
Deep Reinforcement Learning (DRL) has become a powerful tool for developing control policies in queueing networks, but the common use of Multi-layer Perceptron (MLP) neural networks in these applications has significant drawbacks. MLP architectures, while versatile, often suffer from poor sample efficiency and a tendency to overfit training environments, leading to suboptimal performance on new, unseen networks. In response to these issues, we introduce a switch-type neural network (STN) architecture designed to improve the efficiency and generalization of DRL policies in queueing networks. The STN leverages structural patterns from traditional non-learning policies, ensuring consistent action choices across similar states. This design not only streamlines the learning process but also fosters better generalization by reducing the tendency to overfit. Our works presents three key contributions: first, the development of the STN as a more effective alternative to MLPs; second, empirical
Authors
(none)
Tags
Stats
Related papers
- Intervention-assisted Policy Gradient Methods For Online Stochastic Queuing Network Optimization: Technical Report (2024)0.00
- Scalable Multi-task Learning Through Spiking Neural Networks With Adaptive Task-switching Policy For Intelligent Autonomous Agents (2025)0.00
- Sequential Dynamic Decision Making With Deep Neural Nets On A Test-time Budget (2017)0.00
- Dynamics Of Resource Allocation In O-rans: An In-depth Exploration Of On-policy And Off-policy Deep Reinforcement Learning For Real-time Applications (2024)2.26
- Multi-timescale Ensemble Q-learning For Markov Decision Process Policy Optimization (2024)6.34
- Weighted Double Deep Multiagent Reinforcement Learning In Stochastic Cooperative Environments (2018)0.00
- Network Slicing Via Transfer Learning Aided Distributed Deep Reinforcement Learning (2023)7.50
- Deep Reinforcement Learning Based Networked Control With Network Delays For Signal Temporal Logic Specifications (2021)0.00