Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.
GPTKB entity
Statements (10)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:doctoral_degree
|
gptkbp:author |
gptkb:Christopher_J.C.H._Watkins
|
gptkbp:field |
Reinforcement learning
|
https://www.w3.org/2000/01/rdf-schema#label |
Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.
|
gptkbp:memberSchool |
gptkb:University_of_Cambridge
|
gptkbp:notableFor |
introduction of Q-learning algorithm
|
gptkbp:title |
gptkb:Learning_from_Delayed_Rewards
|
gptkbp:year |
1989
|
gptkbp:bfsParent |
gptkb:Q-learning
|
gptkbp:bfsLayer |
5
|