Temporal Difference learning
GPTKB entity
Statements (23)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:Reinforcement_learning_algorithm
|
| gptkbp:abbreviation |
gptkb:TD_learning
|
| gptkbp:application |
gptkb:robot
Control systems Game playing |
| gptkbp:category |
gptkb:artificial_intelligence
Machine learning |
| gptkbp:combines |
gptkb:Monte_Carlo_methods
Dynamic programming |
| gptkbp:example |
gptkb:SARSA
gptkb:Q-learning gptkb:TD(0) |
| gptkbp:introduced |
gptkb:Richard_S._Sutton
|
| gptkbp:introducedIn |
1988
|
| gptkbp:learnsFrom |
Raw experience
|
| gptkbp:relatedTo |
Policy evaluation
Value function approximation |
| gptkbp:updated |
Value estimates
|
| gptkbp:usedIn |
Reinforcement learning
|
| gptkbp:uses |
Bootstrapping
|
| gptkbp:bfsParent |
gptkb:Reinforcement_Learning
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Temporal Difference learning
|