Breaking The Barrier: Enhanced Utility And Robustness In Smoothed DRL Agents
2024 Β· Chung-En Sun, Sicun Gao, Tsui-Wei Weng
Abstract
Robustness remains a paramount concern in deep reinforcement learning (DRL), with randomized smoothing emerging as a key technique for enhancing this attribute. However, a notable gap exists in the performance of current smoothed DRL agents, often characterized by significantly low clean rewards and weak robustness. In response to this challenge, our study introduces innovative algorithms aimed at training effective smoothed robust DRL agents. We propose S-DQN and S-PPO, novel approaches that demonstrate remarkable improvements in clean rewards, empirical robustness, and robustness guarantee across standard RL benchmarks. Notably, our S-DQN and S-PPO agents not only significantly outperform existing smoothed agents by an average factor of \(2.16\times\) under the strongest attack, but also surpass previous robustly-trained agents by an average factor of \(2.13\times\). This represents a significant leap forward in the field. Furthermore, we introduce Smoothed Attack, which is \(1.89\ti
Authors
(none)
Tags
Stats
Related papers
- Policy Smoothing For Provably Robust Reinforcement Learning (2021)0.00
- Robust Deep Reinforcement Learning Against Adversarial Perturbations On State Observations (2020)0.00
- Robust Deep Reinforcement Learning With Adaptive Adversarial Perturbations In Action Space (2024)6.20
- Dreamsmooth: Improving Model-based Reinforcement Learning Via Reward Smoothing (2023)0.00
- Towards Robust Offline-to-online Reinforcement Learning Via Uncertainty And Smoothness (2023)5.24
- Robust Deep Reinforcement Learning Through Adversarial Attacks And Training : A Survey (2024)0.00
- Robust Deep Reinforcement Learning Through Adversarial Loss (2020)0.00
- Robust Reinforcement Learning On State Observations With Learned Optimal Adversary (2021)0.00