Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:doctoral_degree
gptkbp:author	gptkb:Christopher_J.C.H._Watkins
gptkbp:field	Reinforcement learning
gptkbp:memberSchool	gptkb:University_of_Cambridge
gptkbp:notableFor	introduction of Q-learning algorithm
gptkbp:title	gptkb:Learning_from_Delayed_Rewards
gptkbp:year	1989
gptkbp:bfsParent	gptkb:Q-learning
gptkbp:bfsLayer	5
http://www.w3.org/2000/01/rdf-schema#label	Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.