Statements (17)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:software
|
gptkbp:developedBy |
gptkb:CarperAI
|
gptkbp:documentation |
https://trlx.readthedocs.io/
|
gptkbp:focusesOn |
reinforcement learning from human feedback
|
https://www.w3.org/2000/01/rdf-schema#label |
TRLX
|
gptkbp:license |
Apache-2.0
|
gptkbp:openSource |
true
|
gptkbp:programmingLanguage |
gptkb:Python
|
gptkbp:relatedTo |
gptkb:Natural_Language_Processing
gptkb:Reinforcement_Learning transformers |
gptkbp:repository |
https://github.com/CarperAI/trlx
|
gptkbp:supports |
gptkb:OpenAI_Gym_environments
transformer models |
gptkbp:usedFor |
fine-tuning large language models
|
gptkbp:bfsParent |
gptkb:CarperAI
|
gptkbp:bfsLayer |
7
|