Wasserstein Flow Meets Replicator Dynamics: A Mean-field Analysis Of Representation Learning In Actor-critic
2021 Β· Yufeng Zhang, Siyu Chen, Zhuoran Yang, et al.
Abstract
Actor-critic (AC) algorithms, empowered by neural networks, have had significant empirical success in recent years. However, most of the existing theoretical support for AC algorithms focuses on the case of linear function approximations, or linearized neural networks, where the feature representation is fixed throughout training. Such a limitation fails to capture the key aspect of representation learning in neural AC, which is pivotal in practical problems. In this work, we take a mean-field perspective on the evolution and convergence of feature-based neural AC. Specifically, we consider a version of AC where the actor and critic are represented by overparameterized two-layer neural networks and are updated with two-timescale learning rates. The critic is updated by temporal-difference (TD) learning with a larger stepsize while the actor is updated via proximal policy optimization (PPO) with a smaller stepsize. In the continuous-time and infinite-width limiting regime, when the time
Authors
(none)
Tags
Stats
Related papers
- Studying The Interplay Between The Actor And Critic Representations In Reinforcement Learning (2025)0.00
- Functional Critics Are Essential For Actor-critic: From Off-policy Stability To Efficient Exploration (2025)0.00
- Analysis Of A Target-based Actor-critic Algorithm With Linear Function Approximation (2021)0.00
- Learning Mean-field Games Through Mean-field Actor-critic Flow (2025)0.00
- A Finite Time Analysis Of Two Time-scale Actor Critic Methods (2020)0.00
- Actor Critic Learning Algorithms For Mean-field Control With Moment Neural Networks (2023)0.00
- Decision-aware Actor-critic With Function Approximation And Theoretical Guarantees (2023)0.00
- Non-asymptotic Analysis For Single-loop (natural) Actor-critic With Compatible Function Approximation (2024)0.00