Statements (18)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:software
|
| gptkbp:category |
AI benchmarking tool
machine learning evaluation framework |
| gptkbp:developedBy |
gptkb:OpenAI
|
| gptkbp:documentation |
https://github.com/openai/evals/blob/main/README.md
|
| gptkbp:license |
gptkb:MIT_License
|
| gptkbp:programmingLanguage |
gptkb:Python
|
| gptkbp:purpose |
benchmark language models
evaluate LLMs |
| gptkbp:releaseDate |
2023-03-14
|
| gptkbp:repository |
https://github.com/openai/evals
|
| gptkbp:supports |
community-contributed evals
custom evaluation tasks |
| gptkbp:usedFor |
measuring model performance
testing GPT models |
| gptkbp:bfsParent |
gptkb:OpenAI,_Inc.
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
OpenAI Evals
|