Statements (16)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:person
|
| gptkbp:coauthor |
RLHF (Reinforcement Learning from Human Feedback) research
Learning to summarize with human feedback |
| gptkbp:education |
gptkb:Stanford_University
|
| gptkbp:employer |
gptkb:OpenAI
gptkb:Anthropic |
| gptkbp:field |
gptkb:machine_learning
AI alignment |
| gptkbp:knownFor |
work at OpenAI
work at Anthropic |
| gptkbp:nationality |
gptkb:United_States
|
| gptkbp:occupation |
gptkb:entrepreneur
gptkb:researchers |
| gptkbp:bfsParent |
gptkb:ORCA:_Progressive_Learning_from_Complex_Explanation_Traces_of_GPT-4
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Nisan Stiennon
|