Specialized Deep Residual Policy Safe Reinforcement Learning-based Controller For Complex And Continuous State-action Spaces
2023 Β· Ammar N. Abbas, Georgios C. Chasparis, John D. Kelleher
Abstract
Traditional controllers have limitations as they rely on prior knowledge about the physics of the problem, require modeling of dynamics, and struggle to adapt to abnormal situations. Deep reinforcement learning has the potential to address these problems by learning optimal control policies through exploration in an environment. For safety-critical environments, it is impractical to explore randomly, and replacing conventional controllers with black-box models is also undesirable. Also, it is expensive in continuous state and action spaces, unless the search space is constrained. To address these challenges we propose a specialized deep residual policy safe reinforcement learning with a cycle of learning approach adapted for complex and continuous state-action spaces. Residual policy learning allows learning a hybrid control architecture where the reinforcement learning agent acts in synchronous collaboration with the conventional controller. The cycle of learning initiates the policy
Authors
(none)
Tags
Stats
Related papers
- Safe Continual Reinforcement Learning In Non-stationary Environments (2026)12.89
- Learning Robust And Adaptive Real-world Continuous Control Using Simulation And Transfer Learning (2018)0.00
- Concurrent Learning Of Policy And Unknown Safety Constraints In Reinforcement Learning (2024)0.00
- Hierarchical Framework For Interpretable And Probabilistic Model-based Safe Reinforcement Learning (2023)0.00
- Safety Correction From Baseline: Towards The Risk-aware Policy In Robotics Via Dual-agent Reinforcement Learning (2022)3.58
- Context-aware Safe Reinforcement Learning For Non-stationary Environments (2021)9.76
- Deepsafempc: Deep Learning-based Model Predictive Control For Safe Multi-agent Reinforcement Learning (2024)0.00
- Ensemble Reinforcement Learning In Continuous Spaces -- A Hierarchical Multi-step Approach For Policy Training (2022)2.26