Statements (13)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:reinforcement_learning_algorithm
|
| gptkbp:aimsTo |
reduce overestimation bias in Q-learning
|
| gptkbp:appliesTo |
deep reinforcement learning
|
| gptkbp:introducedIn |
2010
|
| gptkbp:proposedBy |
gptkb:Hado_van_Hasselt
|
| gptkbp:publishedIn |
gptkb:AAAI_Conference_on_Artificial_Intelligence
|
| gptkbp:relatedTo |
gptkb:Q-learning
gptkb:Double_DQN |
| gptkbp:uses |
two value functions
|
| gptkbp:variant |
gptkb:Q-learning
|
| gptkbp:bfsParent |
gptkb:Double_DQN
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Double Q-learning
|