Policy Gradient Methods for Reinforcement Learning with Function Approximation

GPTKB entity

Statements (17)
Predicate Object
gptkbp:instanceOf gptkb:academic_journal
gptkbp:author gptkb:Richard_S._Sutton
gptkb:David_McAllester
gptkb:Yishay_Mansour
gptkb:Satinder_Singh
gptkbp:citation many subsequent works in deep reinforcement learning
gptkbp:doi 10.5555/3009657.3009806
gptkbp:field gptkb:machine_learning
gptkb:reinforcement_learning
gptkbp:focusesOn function approximation in policy gradient methods
https://www.w3.org/2000/01/rdf-schema#label Policy Gradient Methods for Reinforcement Learning with Function Approximation
gptkbp:influenced actor-critic algorithms
gptkbp:proposedBy policy gradient methods for reinforcement learning
gptkbp:publicationYear 1999
gptkbp:publishedIn gptkb:Advances_in_Neural_Information_Processing_Systems
gptkbp:bfsParent gptkb:Policy_Gradient
gptkbp:bfsLayer 8