Statements (20)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:person
|
gptkbp:coauthor |
gptkb:Richard_Socher
gptkb:Bryan_McCann gptkb:Caiming_Xiong gptkb:James_Bradbury |
gptkbp:doctoralAdvisor |
gptkb:Christopher_Ré
|
gptkbp:education |
gptkb:Stanford_University
|
gptkbp:employer |
gptkb:Salesforce_Research
|
gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning |
https://www.w3.org/2000/01/rdf-schema#label |
Nitish Shirish Keskar
|
gptkbp:knownFor |
deep learning
natural language processing large language models |
gptkbp:nationality |
gptkb:Indian
|
gptkbp:notableWork |
gptkb:CTRL:_A_Conditional_Transformer_Language_Model_for_Controllable_Generation
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima |
gptkbp:occupation |
gptkb:computer_scientist
|
gptkbp:bfsParent |
gptkb:CTRL
|
gptkbp:bfsLayer |
7
|