Statements (23)
Predicate | Object |
---|---|
gptkbp:instanceOf |
Reinforcement Learning Algorithm
|
gptkbp:application |
gptkb:robot
Finance Control Systems Game Playing |
gptkbp:combines |
gptkb:Dynamic_Programming
Monte Carlo Methods |
gptkbp:compatibleWith |
Model of Environment
|
gptkbp:example |
gptkb:SARSA
gptkb:Q-learning gptkb:TD(0) |
gptkbp:fullName |
gptkb:Temporal_Difference_Learning
|
gptkbp:hasConcept |
Bootstrapping
Temporal Difference Error |
https://www.w3.org/2000/01/rdf-schema#label |
TD Learning
|
gptkbp:introduced |
gptkb:Richard_S._Sutton
|
gptkbp:introducedIn |
1988
|
gptkbp:learnsFrom |
Raw Experience
|
gptkbp:updated |
Value Estimates
|
gptkbp:usedIn |
gptkb:Reinforcement_Learning
|
gptkbp:uses |
Bootstrapping
|
gptkbp:bfsParent |
gptkb:Temporal_Difference_Learning
|
gptkbp:bfsLayer |
7
|