gptkbp:instanceOf
|
person
|
gptkbp:academicAdvisor
|
gptkb:Andrew_Yao
|
gptkbp:affiliation
|
gptkb:University_of_Alberta
|
gptkbp:author
|
gptkb:Hado_van_Hasselt
gptkb:John_Schulman
gptkb:Geoffrey_Hinton
gptkb:David_Poole
gptkb:Leslie_Kaelbling
gptkb:Yoshua_Bengio
gptkb:Michael_G._McTear
gptkb:David_Silver
gptkb:Emma_Brunskill
gptkb:Ali_Farhadi
gptkb:Andrew_G._Barto
gptkb:Michael_Littman
gptkb:Andrew_Ng
gptkb:Jürgen_Schmidhuber
gptkb:Daphne_Koller
gptkb:Sergey_Levine
gptkb:Andrej_Karpathy
gptkb:Richard_Zemel
gptkb:Ronald_Parr
gptkb:Yann_LeCun
gptkb:David_Ha
gptkb:Pieter_Abbeel
gptkb:Ilya_Sutskever
gptkb:Michael_Wellman
gptkb:Marc_G._Bellemare
Judea Pearl
Doina Precup
Fei-Fei Li
Satinder Singh
Volodymyr Mnih
Barbara Grosz
Kevin G. Jamieson
Sham Kakade
Dimitri Bertsekas
Trevor_Darrell
|
gptkbp:awards
|
Canada_Research_Chair
|
gptkbp:birthPlace
|
gptkb:United_States
|
gptkbp:birthYear
|
1958
|
gptkbp:contribution
|
Q-learning
actor-critic methods
policy gradient methods
|
gptkbp:education
|
gptkb:Stanford_University
gptkb:University_of_Massachusetts_Amherst
|
gptkbp:field
|
artificial intelligence
machine learning
|
https://www.w3.org/2000/01/rdf-schema#label
|
Richard S. Sutton
|
gptkbp:influencedBy
|
gptkb:Herbert_A._Simon
gptkb:John_McCarthy
gptkb:Marvin_Minsky
|
gptkbp:knownFor
|
reinforcement learning
|
gptkbp:notableWork
|
Reinforcement Learning: An Introduction
Temporal-Difference_Learning
|
gptkbp:researchFocus
|
decision making
neuroscience
learning algorithms
|