Policy Gradient Methods for Reinforcement Learning with Function Approximation
GPTKB entity
Statements (17)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:academic_journal
|
| gptkbp:author |
gptkb:Richard_S._Sutton
gptkb:David_McAllester gptkb:Yishay_Mansour gptkb:Satinder_Singh |
| gptkbp:citation |
many subsequent works in deep reinforcement learning
|
| gptkbp:doi |
10.5555/3009657.3009806
|
| gptkbp:field |
gptkb:machine_learning
gptkb:reinforcement_learning |
| gptkbp:focusesOn |
function approximation in policy gradient methods
|
| gptkbp:influenced |
actor-critic algorithms
|
| gptkbp:proposedBy |
policy gradient methods for reinforcement learning
|
| gptkbp:publicationYear |
1999
|
| gptkbp:publishedIn |
gptkb:Advances_in_Neural_Information_Processing_Systems
|
| gptkbp:bfsParent |
gptkb:Policy_Gradient
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
Policy Gradient Methods for Reinforcement Learning with Function Approximation
|