Policy Gradient Methods for Reinforcement Learning with Function Approximation
GPTKB entity
Statements (17)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:academic_journal
|
gptkbp:author |
gptkb:Richard_S._Sutton
gptkb:David_McAllester gptkb:Yishay_Mansour gptkb:Satinder_Singh |
gptkbp:citation |
many subsequent works in deep reinforcement learning
|
gptkbp:doi |
10.5555/3009657.3009806
|
gptkbp:field |
gptkb:machine_learning
gptkb:reinforcement_learning |
gptkbp:focusesOn |
function approximation in policy gradient methods
|
https://www.w3.org/2000/01/rdf-schema#label |
Policy Gradient Methods for Reinforcement Learning with Function Approximation
|
gptkbp:influenced |
actor-critic algorithms
|
gptkbp:proposedBy |
policy gradient methods for reinforcement learning
|
gptkbp:publicationYear |
1999
|
gptkbp:publishedIn |
gptkb:Advances_in_Neural_Information_Processing_Systems
|
gptkbp:bfsParent |
gptkb:Policy_Gradient
|
gptkbp:bfsLayer |
8
|