Reinforcement Learning With Convex Constraints
2019 · Sobhan Miryoosefi, Kianté Brantley, Hal Daumé, et al.
Abstract
In standard reinforcement learning (RL), a learning agent seeks to optimize the overall reward. However, many key aspects of a desired behavior are more naturally expressed as constraints. For instance, the designer may want to limit the use of unsafe actions, increase the diversity of trajectories to enable exploration, or approximate expert trajectories when rewards are sparse. In this paper, we propose an algorithmic scheme that can handle a wide class of constraints in RL tasks: specifically, any constraints that require expected values of some vector measurements (such as the use of an action) to lie in a convex set. This captures previously studied constraints (such as safety and proximity to an expert), but also enables new classes of constraints (such as diversity). Our approach comes with rigorous theoretical guarantees and only relies on the ability to approximately solve standard RL tasks. As a result, it can be easily adapted to work with any model-free or model-based RL. I
Authors
(none)
Tags
Stats
Related papers
- Challenging Common Assumptions In Convex Reinforcement Learning (2022)0.00
- Global Reinforcement Learning: Beyond Linear And Convex Rewards Via Submodular Semi-gradient Methods (2024)0.00
- Provably Efficient Exploration In Inverse Constrained Reinforcement Learning (2024)0.00
- Solving Richly Constrained Reinforcement Learning Through State Augmentation And Reward Penalties (2023)0.00
- Constrained Model-based Reinforcement Learning With Robust Cross-entropy Method (2020)0.00
- Imitate The Good And Avoid The Bad: An Incremental Approach To Safe Reinforcement Learning (2023)0.00
- CRPO: A New Approach For Safe Reinforcement Learning With Convergence Guarantee (2020)0.00
- Constraint-conditioned Policy Optimization For Versatile Safe Reinforcement Learning (2023)0.00