Statements (18)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:software
|
gptkbp:category |
AI benchmarking tool
machine learning evaluation framework |
gptkbp:developedBy |
gptkb:OpenAI
|
gptkbp:documentation |
https://github.com/openai/evals/blob/main/README.md
|
https://www.w3.org/2000/01/rdf-schema#label |
OpenAI Evals
|
gptkbp:license |
gptkb:MIT_License
|
gptkbp:programmingLanguage |
gptkb:Python
|
gptkbp:purpose |
benchmark language models
evaluate LLMs |
gptkbp:releaseDate |
2023-03-14
|
gptkbp:repository |
https://github.com/openai/evals
|
gptkbp:supports |
community-contributed evals
custom evaluation tasks |
gptkbp:usedFor |
measuring model performance
testing GPT models |
gptkbp:bfsParent |
gptkb:OpenAI,_Inc.
|
gptkbp:bfsLayer |
7
|