gptkbp:instanceOf
|
gptkb:algorithm
|
gptkbp:appliesTo
|
continuous action spaces
|
gptkbp:basedOn
|
actor-critic
deterministic policy gradient
|
gptkbp:citation
|
Continuous control with deep reinforcement learning
https://arxiv.org/abs/1509.02971
|
gptkbp:field
|
gptkb:reinforcement_learning
|
gptkbp:fullName
|
gptkb:Deep_Deterministic_Policy_Gradient
|
https://www.w3.org/2000/01/rdf-schema#label
|
DDPG
|
gptkbp:introduced
|
gptkb:David_Silver
gptkb:Alexander_Pritzel
gptkb:Jonathan_J._Hunt
gptkb:Nicolas_Heess
gptkb:Tom_Erez
gptkb:Yuval_Tassa
gptkb:Timothy_P._Lillicrap
gptkb:Daan_Wierstra
|
gptkbp:openSource
|
gptkb:OpenAI_Baselines
gptkb:Stable_Baselines
gptkb:TensorFlow_Agents
PyTorch RL libraries
|
gptkbp:publicationYear
|
2015
|
gptkbp:publishedIn
|
gptkb:arXiv
|
gptkbp:relatedTo
|
gptkb:Deep_Q-Network
gptkb:Twin_Delayed_DDPG
gptkb:Soft_Actor-Critic
|
gptkbp:uses
|
deep neural networks
experience replay
target networks
|
gptkbp:bfsParent
|
gptkb:Actor-Critic
gptkb:OpenAI_Baselines
gptkb:Stable_Baselines
|
gptkbp:bfsLayer
|
6
|