DEFENDER: Dtw-based Episode Filtering Using Demonstrations For Enhancing RL Safety
2023 · André Correia, Luís Alexandre
Abstract
Deploying reinforcement learning agents in the real world can be challenging due to the risks associated with learning through trial and error. We propose a task-agnostic method that leverages small sets of safe and unsafe demonstrations to improve the safety of RL agents during learning. The method compares the current trajectory of the agent with both sets of demonstrations at every step, and filters the trajectory if it resembles the unsafe demonstrations. We perform ablation studies on different filtering strategies and investigate the impact of the number of demonstrations on performance. Our method is compatible with any stand-alone RL algorithm and can be applied to any task. We evaluate our method on three tasks from OpenAI Gym's Mujoco benchmark and two state-of-the-art RL algorithms. The results demonstrate that our method significantly reduces the crash rate of the agent while converging to, and in most cases even improving, the performance of the stand-alone agent.
Authors
(none)
Tags
Stats
Related papers
- Provably Optimal Reinforcement Learning Under Safety Filtering (2025)0.00
- Guided Online Distillation: Promoting Safe Reinforcement Learning By Offline Demonstration (2023)4.52
- Safe Reinforcement Learning In Black-box Environments Via Adaptive Shielding (2024)2.26
- Dyna-style Safety Augmented Reinforcement Learning: Staying Safe In The Face Of Uncertainty (2026)0.00
- Learning Safe Policies With Expert Guidance (2018)0.00
- Implicit Safe Set Algorithm For Provably Safe Reinforcement Learning (2024)0.00
- On Assessing The Safety Of Reinforcement Learning Algorithms Using Formal Methods (2021)0.00
- Regret-based Defense In Adversarial Reinforcement Learning (2023)0.00