← all papers Β· overview

Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines

Abstract

We show how an action-dependent baseline can be used by the policy gradient theorem using function approximation, originally presented with action-independent baselines by (Sutton et al. 2000).

Related papers

Ranked by semantic similarity β€” how closely each paper's abstract matches this one (100% = near-identical topic).