Locally Private Distributed Reinforcement Learning
2020 Β· Hajime Ono, Tsubasa Takahashi
Abstract
We study locally differentially private algorithms for reinforcement learning to obtain a robust policy that performs well across distributed private environments. Our algorithm protects the information of local agents' models from being exploited by adversarial reverse engineering. Since a local policy is strongly being affected by the individual environment, the output of the agent may release the private information unconsciously. In our proposed algorithm, local agents update the model in their environments and report noisy gradients designed to satisfy local differential privacy (LDP) that gives a rigorous local privacy guarantee. By utilizing a set of reported noisy gradients, a central aggregator updates its model and delivers it to different local agents. In our empirical evaluation, we demonstrate how our method performs well under LDP. To the best of our knowledge, this is the first work that actualizes distributed reinforcement learning under LDP. This work enables us to obt
Authors
(none)
Tags
Stats
Related papers
- Efficient Differentially Private Fine-tuning Of Llms Via Reinforcement Learning (2025)0.00
- Differentially Private Policy Evaluation (2016)0.00
- Offline Reinforcement Learning With Differential Privacy (2022)0.00
- Local Differential Privacy For Regret Minimization In Reinforcement Learning (2020)0.00
- Near-optimal Differentially Private Reinforcement Learning (2022)0.00
- Privacy-preserving Reinforcement Learning From Human Feedback Via Decoupled Reward Modeling (2026)0.00
- Privacy Preserving Reinforcement Learning For Population Processes (2024)0.00
- Preserving Expert-level Privacy In Offline Reinforcement Learning (2024)0.00