Statements (17)
Predicate | Object |
---|---|
gptkbp:instance_of |
gptkb:Artificial_Intelligence
|
gptkbp:applies_to |
Markov decision processes
game playing |
gptkbp:developed_by |
gptkb:Richard_Sutton
|
gptkbp:focuses_on |
on-policy learning
|
gptkbp:has_variants |
Sarsa(λ)
|
https://www.w3.org/2000/01/rdf-schema#label |
Sarsa
|
gptkbp:improves |
policy evaluation
|
gptkbp:is_compared_to |
Monte Carlo methods
|
gptkbp:is_similar_to |
Temporal Difference Learning
|
gptkbp:is_used_in |
gptkb:robot
|
gptkbp:related_to |
Q-learning
|
gptkbp:type |
reinforcement learning
|
gptkbp:used_in |
gptkb:software_framework
|
gptkbp:bfsParent |
gptkb:Daria_Zawiałow
gptkb:Sony_Music_Poland |
gptkbp:bfsLayer |
5
|