Physics-informed RL For Maximal Safety Probability Estimation
2024 Β· Hikaru Hoshino, Yorie Nakahira
Abstract
Accurate risk quantification and reachability analysis are crucial for safe control and learning, but sampling from rare events, risky states, or long-term trajectories can be prohibitively costly. Motivated by this, we study how to estimate the long-term safety probability of maximally safe actions without sufficient coverage of samples from risky states and long-term trajectories. The use of maximal safety probability in control and learning is expected to avoid conservative behaviors due to over-approximation of risk. Here, we first show that long-term safety probability, which is multiplicative in time, can be converted into additive costs and be solved using standard reinforcement learning methods. We then derive this probability as solutions of partial differential equations (PDEs) and propose Physics-Informed Reinforcement Learning (PIRL) algorithm. The proposed method can learn using sparse rewards because the physics constraints help propagate risk information through neighbor
Authors
(none)
Tags
Stats
Related papers
- Concurrent Learning Of Policy And Unknown Safety Constraints In Reinforcement Learning (2024)0.00
- Provably Optimal Reinforcement Learning Under Safety Filtering (2025)0.00
- DOPE: Doubly Optimistic And Pessimistic Exploration For Safe Reinforcement Learning (2021)0.00
- Safe Reinforcement Learning In Tensor Reproducing Kernel Hilbert Space (2023)0.00
- Actsafe: Active Exploration With Safety Constraints For Reinforcement Learning (2024)0.00
- Feasible Policy Iteration For Safe Reinforcement Learning (2023)0.00
- Predictable Reinforcement Learning Dynamics Through Entropy Rate Minimization (2023)0.00
- Conservative And Adaptive Penalty For Model-based Safe Reinforcement Learning (2021)0.00