The Blessing Of Heterogeneity In Federated Q-learning: Linear Speedup And Beyond
2023 Β· Jiin Woo, Gauri Joshi, Yuejie Chi
Abstract
When the data used for reinforcement learning (RL) are collected by multiple agents in a distributed manner, federated versions of RL algorithms allow collaborative learning without the need for agents to share their local data. In this paper, we consider federated Q-learning, which aims to learn an optimal Q-function by periodically aggregating local Q-estimates trained on local data alone. Focusing on infinite-horizon tabular Markov decision processes, we provide sample complexity guarantees for both the synchronous and asynchronous variants of federated Q-learning. In both cases, our bounds exhibit a linear speedup with respect to the number of agents and near-optimal dependencies on other salient problem parameters. In the asynchronous setting, existing analyses of federated Q-learning, which adopt an equally weighted averaging of local Q-estimates, require that every agent covers the entire state-action space. In contrast, our improved sample complexity scales inverse proportion
Authors
(none)
Tags
Stats
Related papers
- Federated Q-learning: Linear Regret Speedup With Low Communication Cost (2023)0.00
- Federated Stochastic Approximation Under Markov Noise And Heterogeneity: Applications In Reinforcement Learning (2022)0.00
- The Sample-communication Complexity Trade-off In Federated Q-learning (2024)0.00
- Federated Offline Reinforcement Learning: Collaborative Single-policy Coverage Suffices (2024)0.00
- A Finite Time Analysis Of Distributed Q-learning (2024)0.00
- Federated Q-learning With Reference-advantage Decomposition: Almost Optimal Regret And Logarithmic Communication Cost (2024)0.00
- Momentum For The Win: Collaborative Federated Reinforcement Learning Across Heterogeneous Environments (2024)0.00
- Achieving Tighter Finite-time Rates For Heterogeneous Federated Stochastic Approximation Under Markovian Sampling (2025)0.00