Causality Meets Locality: Provably Generalizable And Scalable Policy Learning For Networked Systems
2025 Β· Hao Liang, Shuqing Shi, Yudi Zhang, et al.
Abstract
Large-scale networked systems, such as traffic, power, and wireless grids, challenge reinforcement-learning agents with both scale and environment shifts. To address these challenges, we propose GSAC (Generalizable and Scalable Actor-Critic), a framework that couples causal representation learning with meta actor-critic learning to achieve both scalability and domain generalization. Each agent first learns a sparse local causal mask that provably identifies the minimal neighborhood variables influencing its dynamics, yielding exponentially tight approximately compact representations (ACRs) of state and domain factors. These ACRs bound the error of truncating value functions to \(\kappa\)-hop neighborhoods, enabling efficient learning on graphs. A meta actor-critic then trains a shared policy across multiple source domains while conditioning on the compact domain factors; at test time, a few trajectories suffice to estimate the new domain factor and deploy the adapted policy. We establi
Authors
(none)
Tags
Stats
Related papers
- Scalable Reinforcement Learning For Multi-agent Networked Systems (2019)10.35
- Scalable Multi-agent Reinforcement Learning For Networked Systems With Average Reward (2020)0.00
- Transformer-based Scalable Multi-agent Reinforcement Learning For Networked Systems With Long-range Interactions (2025)0.00
- Multi-agent Reinforcement Learning In Stochastic Networked Systems (2020)0.00
- Local Advantage Actor-critic For Robust Multi-agent Deep Reinforcement Learning (2021)7.81
- Learning To Coordinate In Multi-agent Systems: A Coordinated Actor-critic Algorithm And Finite-time Guarantees (2021)0.00
- ACE : Off-policy Actor-critic With Causality-aware Entropy Regularization (2024)0.00
- Evolving Pareto-optimal Actor-critic Algorithms For Generalizability And Stability (2022)0.00