Simple statistical gradient-following algorithms for connectionist reinforcement learning
GPTKB entity
Statements (13)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:academic_journal
|
| gptkbp:author |
gptkb:Ronald_J._Williams
|
| gptkbp:citation |
many subsequent works in reinforcement learning
|
| gptkbp:contribution |
gptkb:REINFORCE_algorithm
|
| gptkbp:doi |
10.1007/BF00992696
|
| gptkbp:field |
gptkb:machine_learning
gptkb:reinforcement_learning |
| gptkbp:language |
English
|
| gptkbp:publicationYear |
1992
|
| gptkbp:publishedIn |
gptkb:Machine_Learning
|
| gptkbp:bfsParent |
gptkb:Ronald_J._Williams
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Simple statistical gradient-following algorithms for connectionist reinforcement learning
|