Statements (26)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:person
|
| gptkbp:author |
Continuous control with deep reinforcement learning
|
| gptkbp:awardReceived |
gptkb:Canada_Graduate_Scholarship
|
| gptkbp:coauthor |
gptkb:David_Silver
gptkb:Alexander_Pritzel gptkb:Jonathan_J._Hunt gptkb:Nicolas_Heess gptkb:Tom_Erez gptkb:Yuval_Tassa gptkb:Daan_Wierstra |
| gptkbp:doctoralAdvisor |
gptkb:Geoffrey_Hinton
|
| gptkbp:education |
gptkb:University_of_Toronto
gptkb:Queen's_University_at_Kingston |
| gptkbp:employer |
gptkb:DeepMind
|
| gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning computational neuroscience |
| gptkbp:knownFor |
deep reinforcement learning
deterministic policy gradient algorithms |
| gptkbp:nationality |
gptkb:Canadian
|
| gptkbp:occupation |
gptkb:computer_scientist
gptkb:neuroscientist |
| gptkbp:bfsParent |
gptkb:Oriol_Vinyals
gptkb:ACKTR |
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Timothy Lillicrap
|