Funnel-based Reward Shaping For Signal Temporal Logic Tasks In Reinforcement Learning
2022 Β· Naman Saxena, Gorantla Sandeep, Pushpak Jagtap
Abstract
Signal Temporal Logic (STL) is a powerful framework for describing the complex temporal and logical behaviour of the dynamical system. Numerous studies have attempted to employ reinforcement learning to learn a controller that enforces STL specifications; however, they have been unable to effectively tackle the challenges of ensuring robust satisfaction in continuous state space and maintaining tractability. In this paper, leveraging the concept of funnel functions, we propose a tractable reinforcement learning algorithm to learn a time-dependent policy for robust satisfaction of STL specification in continuous state space. We demonstrate the utility of our approach on several STL tasks using different environments.
Authors
(none)
Tags
Stats
Related papers
- A Hierarchical Reinforcement Learning Method For Persistent Time-sensitive Tasks (2016)0.00
- A Policy Search Method For Temporal Logic Specified Reinforcement Learning Tasks (2017)11.58
- TGPO: Temporal Grounded Policy Optimization For Signal Temporal Logic Tasks (2025)0.00
- Temporal-logic-based Reward Shaping For Continuing Reinforcement Learning Tasks (2020)9.76
- Directed Exploration In Reinforcement Learning From Linear Temporal Logic (2024)0.00
- Deep Reinforcement Learning Based Networked Control With Network Delays For Signal Temporal Logic Specifications (2021)0.00
- Stratifying Reinforcement Learning With Signal Temporal Logic (2026)0.00
- Stlgame: Signal Temporal Logic Games In Adversarial Multi-agent Systems (2024)0.00