Learning from Delayed Rewards

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:book
gptkbp:author	gptkb:Christopher_Watkins
gptkbp:countryOfOrigin	gptkb:United_Kingdom
gptkbp:field	gptkb:reinforcement_learning
gptkbp:language	English
gptkbp:notableFor	introduction of Q-learning
gptkbp:publicationYear	1989
gptkbp:publisher	gptkb:King's_College,_Cambridge
gptkbp:type	gptkb:doctoral_degree
gptkbp:bfsParent	gptkb:Watkins,_C.J.C.H._(1989)._Learning_from_Delayed_Rewards._PhD_thesis,_University_of_Cambridge.
gptkbp:bfsLayer	6
http://www.w3.org/2000/01/rdf-schema#label	Learning from Delayed Rewards