Statements (16)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:person
|
gptkbp:affiliation |
gptkb:Northeastern_University
|
gptkbp:author |
gptkb:Simple_statistical_gradient-following_algorithms_for_connectionist_reinforcement_learning
A learning algorithm for continually running fully recurrent neural networks |
gptkbp:doctoralAdvisor |
gptkb:Stuart_E._Dreyfus
|
gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning neural networks |
https://www.w3.org/2000/01/rdf-schema#label |
Ronald J. Williams
|
gptkbp:knownFor |
gptkb:reinforcement_learning
gptkb:backpropagation_through_time gptkb:REINFORCE_algorithm |
gptkbp:nationality |
gptkb:American
|
gptkbp:occupation |
gptkb:computer_scientist
|
gptkbp:bfsParent |
gptkb:Learning_representations_by_back-propagating_errors
|
gptkbp:bfsLayer |
5
|