Deterministic Policy Gradient Algorithms
GPTKB entity
Statements (22)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:reinforcement_learning_algorithm
|
| gptkbp:author |
David Silver, Guy Lever, Nicolas Heess, Thomas Degris, Daan Wierstra, Martin Riedmiller
|
| gptkbp:citation |
gptkb:International_Conference_on_Machine_Learning
gptkb:Deterministic_Policy_Gradient_Algorithms 2014 |
| gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning |
| gptkbp:introduced |
gptkb:Martin_Riedmiller
gptkb:David_Silver gptkb:Nicolas_Heess gptkb:Daan_Wierstra Guy Lever Thomas Degris |
| gptkbp:introducedIn |
2014
|
| gptkbp:publishedIn |
gptkb:ICML_2014
|
| gptkbp:relatedTo |
gptkb:Deep_Deterministic_Policy_Gradient
Actor-Critic Methods Policy Gradient Methods |
| gptkbp:usedIn |
continuous action spaces
|
| gptkbp:bfsParent |
gptkb:Policy_Gradient
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
Deterministic Policy Gradient Algorithms
|