Reward Engineering For Spatial Epidemic Simulations: A Reinforcement Learning Platform For Individual Behavioral Learning
2025 Β· Radman Rakhshandehroo, Daniel Coombs
Abstract
We present ContagionRL, a Gymnasium-compatible reinforcement learning platform specifically designed for systematic reward engineering in spatial epidemic simulations. Unlike traditional agent-based models that rely on fixed behavioral rules, our platform enables rigorous evaluation of how reward function design affects learned survival strategies across diverse epidemic scenarios. ContagionRL integrates a spatial SIRS+D epidemiological model with configurable environmental parameters, allowing researchers to stress-test reward functions under varying conditions including limited observability, different movement patterns, and heterogeneous population dynamics. We evaluate five distinct reward designs, ranging from sparse survival bonuses to a novel potential field approach, across multiple RL algorithms (PPO, SAC, A2C). Through systematic ablation studies, we identify that directional guidance and explicit adherence incentives are critical components for robust policy learning. Our co
Authors
(none)
Tags
Stats
Related papers
- A Microscopic Pandemic Simulator For Pandemic Prediction Using Scalable Million-agent Reinforcement Learning (2021)0.00
- Dynamic Noises Of Multi-agent Environments Can Improve Generalization: Agent-based Models Meets Reinforcement Learning (2022)0.00
- Disturbing Reinforcement Learning Agents With Corrupted Rewards (2021)0.00
- From Pheromones To Policies: Reinforcement Learning For Engineered Biological Swarms (2025)0.00
- Optimizing Resource-constrained Non-pharmaceutical Interventions For Multi-cluster Outbreak Control Using Hierarchical Reinforcement Learning (2026)0.00
- Learning Reward Functions For Cooperative Resilience In Multi-agent Systems (2026)0.00
- Survival Of The Fittest: Evolutionary Adaptation Of Policies For Environmental Shifts (2024)2.26
- Modeling Human Reputation-seeking Behavior In A Spatio-temporally Complex Public Good Provision Game (2025)0.00