Statements (25)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:person
|
gptkbp:author |
Continuous control with deep reinforcement learning
|
gptkbp:awardReceived |
gptkb:Canada_Graduate_Scholarship
|
gptkbp:coauthor |
gptkb:David_Silver
gptkb:Alexander_Pritzel gptkb:Jonathan_J._Hunt gptkb:Nicolas_Heess gptkb:Tom_Erez gptkb:Yuval_Tassa gptkb:Daan_Wierstra |
gptkbp:doctoralAdvisor |
gptkb:Geoffrey_Hinton
|
gptkbp:education |
gptkb:University_of_Toronto
gptkb:Queen's_University_at_Kingston |
gptkbp:employer |
gptkb:DeepMind
|
gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning computational neuroscience |
https://www.w3.org/2000/01/rdf-schema#label |
Timothy Lillicrap
|
gptkbp:knownFor |
deep reinforcement learning
deterministic policy gradient algorithms |
gptkbp:nationality |
gptkb:Canadian
|
gptkbp:occupation |
gptkb:computer_scientist
gptkb:neuroscientist |
gptkbp:bfsParent |
gptkb:Oriol_Vinyals
|
gptkbp:bfsLayer |
6
|