Simple statistical gradient-following algorithms for connectionist reinforcement learning
GPTKB entity
Statements (13)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:academic_journal
|
gptkbp:author |
gptkb:Ronald_J._Williams
|
gptkbp:citation |
many subsequent works in reinforcement learning
|
gptkbp:contribution |
gptkb:REINFORCE_algorithm
|
gptkbp:doi |
10.1007/BF00992696
|
gptkbp:field |
gptkb:machine_learning
gptkb:reinforcement_learning |
https://www.w3.org/2000/01/rdf-schema#label |
Simple statistical gradient-following algorithms for connectionist reinforcement learning
|
gptkbp:language |
English
|
gptkbp:publicationYear |
1992
|
gptkbp:publishedIn |
gptkb:Machine_Learning
|
gptkbp:bfsParent |
gptkb:Ronald_J._Williams
|
gptkbp:bfsLayer |
6
|