DDPG

GPTKB entity

Predicate	Object
gptkbp:instance_of	gptkb:Artificial_Intelligence
gptkbp:applies_to	continuous action spaces
gptkbp:based_on	actor-critic architecture
gptkbp:can_be_combined_with	other techniques
gptkbp:developed_by	gptkb:Google_Deep_Mind
gptkbp:first_introduced	gptkb:2015
gptkbp:has_expansion	deterministic policy gradient methods
https://www.w3.org/2000/01/rdf-schema#label	DDPG
gptkbp:improves	gptkb:DQN
gptkbp:is_applied_in	healthcare finance autonomous driving
gptkbp:is_used_in	gptkb:robotics game playing control tasks
gptkbp:requires	hyperparameter tuning
gptkbp:sensitivity	initialization
gptkbp:suffered_from	overestimation bias
gptkbp:type	gptkb:machine_learning
gptkbp:uses	deep neural networks
gptkbp:utilizes	experience replay target networks
gptkbp:bfsParent	gptkb:Stable_Baselines gptkb:Keras-RL gptkb:Lunar_Lander-v2 gptkb:Open_AI_Baselines
gptkbp:bfsLayer	5