Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.
GPTKB entity
Statements (10)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:doctoral_degree
|
| gptkbp:author |
gptkb:Christopher_J.C.H._Watkins
|
| gptkbp:field |
Reinforcement learning
|
| gptkbp:memberSchool |
gptkb:University_of_Cambridge
|
| gptkbp:notableFor |
introduction of Q-learning algorithm
|
| gptkbp:title |
gptkb:Learning_from_Delayed_Rewards
|
| gptkbp:year |
1989
|
| gptkbp:bfsParent |
gptkb:Q-learning
|
| gptkbp:bfsLayer |
5
|
| https://www.w3.org/2000/01/rdf-schema#label |
Watkins, C.J.C.H. (1989). Learning from Delayed Rewards. PhD thesis, University of Cambridge.
|