Optimizing Resource-constrained Non-pharmaceutical Interventions For Multi-cluster Outbreak Control Using Hierarchical Reinforcement Learning
2026 Β· Xueqiao Peng, Andrew Perrault
Abstract
arXiv:2603.19397v2 Announce Type: replace Abstract: Non-pharmaceutical interventions (NPIs), such as diagnostic testing and quarantine, are crucial for controlling infectious disease outbreaks but are often constrained by limited resources, particularly in early outbreak stages. In real-world public health settings, resources must be allocated across multiple outbreak clusters that emerge asynchronously, vary in size and risk, and compete for a shared resource budget. Here, a cluster corresponds to a group of close contacts generated by a single infected index case. Thus, decisions must be made under uncertainty and heterogeneous demands, while respecting operational constraints. We formulate this problem as a constrained restless multi-armed bandit and propose a hierarchical reinforcement learning framework. A global controller learns a continuous action cost multiplier that adjusts global resource demand, while a generalized local policy estimates the marginal value of allocating re
Authors
(none)
Tags
Stats
Related papers
- Reward Engineering For Spatial Epidemic Simulations: A Reinforcement Learning Platform For Individual Behavioral Learning (2025)0.00
- Stochastic Optimization For Vaccine And Testing Kit Allocation For The COVID-19 Pandemic (2021)0.00
- Multi-agent Reinforcement Learning With Hierarchical Coordination For Emergency Responder Stationing (2024)0.00
- A Microscopic Pandemic Simulator For Pandemic Prediction Using Scalable Million-agent Reinforcement Learning (2021)0.00
- Scalable Policy Maximization Under Network Interference (2025)0.00
- IRL For Restless Multi-armed Bandits With Applications In Maternal And Child Health (2024)0.00
- Harnessing Data From Clustered LQR Systems: Personalized And Collaborative Policy Optimization (2025)0.00
- Hierarachical Multiagent Reinforcement Learning For Multi-group Tax Game (2026)0.00