Statements (13)
| Predicate | Object | 
|---|---|
| gptkbp:instanceOf | gptkb:reinforcement_learning_algorithm | 
| gptkbp:aimsTo | reduce overestimation bias in Q-learning | 
| gptkbp:appliesTo | deep reinforcement learning | 
| gptkbp:introducedIn | 2010 | 
| gptkbp:proposedBy | gptkb:Hado_van_Hasselt | 
| gptkbp:publishedIn | gptkb:AAAI_Conference_on_Artificial_Intelligence | 
| gptkbp:relatedTo | gptkb:Q-learning gptkb:Double_DQN | 
| gptkbp:uses | two value functions | 
| gptkbp:variant | gptkb:Q-learning | 
| gptkbp:bfsParent | gptkb:Double_DQN | 
| gptkbp:bfsLayer | 7 | 
| https://www.w3.org/2000/01/rdf-schema#label | Double Q-learning |