Statements (26)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:neural_network_technique
|
| gptkbp:appliesTo |
actor-critic methods
Deep Q-Networks |
| gptkbp:hasConcept |
add parameterized noise to neural network weights
|
| gptkbp:organization |
gptkb:DeepMind
|
| gptkbp:proposedBy |
gptkb:Demis_Hassabis
gptkb:Alex_Graves gptkb:Bilal_Piot Charles Blundell Ian Osband Jacob Menick Meire Fortunato Mohammad Gheshlaghi Azar Olivier Pietquin Remi Munos Vlad Mnih |
| gptkbp:publicationYear |
2017
|
| gptkbp:publishedIn |
arXiv:1706.10295
|
| gptkbp:purpose |
improve exploration in reinforcement learning
|
| gptkbp:relatedTo |
exploration-exploitation tradeoff
parameter noise randomized value functions |
| gptkbp:usedIn |
gptkb:reinforcement_learning
|
| gptkbp:bfsParent |
gptkb:Rainbow_DQN
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Noisy Nets
|