A Single-loop Deep Actor-critic Algorithm For Constrained Reinforcement Learning With Provable Convergence
2023 Β· Kexuan Wang, An Liu, Baishuo Lin
Abstract
Deep Actor-Critic algorithms, which combine Actor-Critic with deep neural network (DNN), have been among the most prevalent reinforcement learning algorithms for decision-making problems in simulated environments. However, the existing deep Actor-Critic algorithms are still not mature to solve realistic problems with non-convex stochastic constraints and high cost to interact with the environment. In this paper, we propose a single-loop deep Actor-Critic (SLDAC) algorithmic framework for general constrained reinforcement learning (CRL) problems. In the actor step, the constrained stochastic successive convex approximation (CSSCA) method is applied to handle the non-convex stochastic objective and constraints. In the critic step, the critic DNNs are only updated once or a few finite times for each iteration, which simplifies the algorithm to a single-loop framework (the existing works require a sufficient number of updates for the critic step to ensure a good enough convergence of the i
Authors
(none)
Tags
Stats
Related papers
- Broad Critic Deep Actor Reinforcement Learning For Continuous Control (2024)0.00
- Recursive Least Squares Advantage Actor-critic Algorithms (2022)0.00
- Efficient Exploration In Deep Reinforcement Learning: A Novel Bayesian Actor-critic Algorithm (2024)0.00
- Global Convergence Of Two-timescale Actor-critic For Solving Linear Quadratic Regulator (2022)4.52
- DR-SAC: Distributionally Robust Soft Actor-critic For Reinforcement Learning Under Uncertainty (2025)0.00
- Actor-critic Algorithms For Constrained Multi-agent Reinforcement Learning (2019)0.00
- Soft Actor-critic: Off-policy Maximum Entropy Deep Reinforcement Learning With A Stochastic Actor (2018)0.00
- Langevin Soft Actor-critic: Efficient Exploration Through Uncertainty-driven Critic Learning (2025)0.00