Safe Continual Reinforcement Learning In Non-stationary Environments
2026 Β· Austin Coursey, Abel Diaz-Gonzalez, Marcos Quinones-Grueiro, et al.
Abstract
Reinforcement learning (RL) offers a compelling data-driven paradigm for synthesizing controllers for complex systems when accurate physical models are unavailable; however, most existing control-oriented RL methods assume stationarity and, therefore, struggle in real-world non-stationary deployments where system dynamics and operating conditions can change unexpectedly. Moreover, RL controllers acting in physical environments must satisfy safety constraints throughout their learning and execution phases, rendering transient violations during adaptation unacceptable. Although continual RL and safe RL have each addressed non-stationarity and safety, respectively, their intersection remains comparatively unexplored, motivating the study of safe continual RL algorithms that can adapt over the system's lifetime while preserving safety. In this work, we systematically investigate safe continual reinforcement learning by introducing three benchmark environments that capture safety-critical c
Authors
(none)
Tags
Stats
Related papers
- Safe Continual Reinforcement Learning Methods For Nonstationary Environments. Towards A Survey Of The State Of The Art (2026)0.00
- Context-aware Safe Reinforcement Learning For Non-stationary Environments (2021)9.76
- Specialized Deep Residual Policy Safe Reinforcement Learning-based Controller For Complex And Continuous State-action Spaces (2023)4.52
- Concurrent Learning Of Policy And Unknown Safety Constraints In Reinforcement Learning (2024)0.00
- Demystifying Reinforcement Learning In Time-varying Systems (2022)0.00
- Actsafe: Active Exploration With Safety Constraints For Reinforcement Learning (2024)0.00
- Reinforcement Learning With Adaptive Regularization For Safe Control Of Critical Systems (2024)0.00
- Implicit Safe Set Algorithm For Provably Safe Reinforcement Learning (2024)0.00