Statements (33)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:person
|
gptkbp:author |
Hindsight Experience Replay
|
gptkbp:coauthor |
gptkb:Bob_McGrew
gptkb:Ilya_Sutskever gptkb:Pieter_Abbeel gptkb:Wojciech_Zaremba gptkb:Lukasz_Kaiser gptkb:Alex_Ray gptkb:Jakub_Pachocki gptkb:Matteo_Hessel Jonas Schneider Peter Welinder Rachel Fong Wojciech Marian Czarnecki |
gptkbp:education |
gptkb:University_of_Warsaw
|
gptkbp:employer |
gptkb:OpenAI
|
gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning gptkb:reinforcement_learning |
https://www.w3.org/2000/01/rdf-schema#label |
Marcin Andrychowicz
|
gptkbp:knownFor |
multi-agent systems
deep reinforcement learning |
gptkbp:nationality |
Polish
|
gptkbp:notableWork |
gptkb:OpenAI_Five
|
gptkbp:occupation |
gptkb:computer_scientist
|
gptkbp:publishedIn |
gptkb:arXiv
gptkb:ICLR gptkb:NeurIPS |
gptkbp:researchInterest |
deep learning
robotics AI safety |
gptkbp:bfsParent |
gptkb:Learning_to_Learn_with_Gradients
|
gptkbp:bfsLayer |
7
|