Statements (18)
| Predicate | Object | 
|---|---|
| gptkbp:instanceOf | 
                                    
                                        
                                            gptkb:software
                                        
                                         | 
                            
| gptkbp:category | 
                                    
                                        
                                            
                                            AI benchmarking tool
                                        
                                        
                                         machine learning evaluation framework  | 
                            
| gptkbp:developedBy | 
                                    
                                        
                                            gptkb:OpenAI
                                        
                                         | 
                            
| gptkbp:documentation | 
                                    
                                        
                                            
                                            https://github.com/openai/evals/blob/main/README.md
                                        
                                        
                                         | 
                            
| gptkbp:license | 
                                    
                                        
                                            gptkb:MIT_License
                                        
                                         | 
                            
| gptkbp:programmingLanguage | 
                                    
                                        
                                            gptkb:Python
                                        
                                         | 
                            
| gptkbp:purpose | 
                                    
                                        
                                            
                                            benchmark language models
                                        
                                        
                                         evaluate LLMs  | 
                            
| gptkbp:releaseDate | 
                                    
                                        
                                            
                                            2023-03-14
                                        
                                        
                                         | 
                            
| gptkbp:repository | 
                                    
                                        
                                            
                                            https://github.com/openai/evals
                                        
                                        
                                         | 
                            
| gptkbp:supports | 
                                    
                                        
                                            
                                            community-contributed evals
                                        
                                        
                                         custom evaluation tasks  | 
                            
| gptkbp:usedFor | 
                                    
                                        
                                            
                                            measuring model performance
                                        
                                        
                                         testing GPT models  | 
                            
| gptkbp:bfsParent | 
                                    
                                        
                                            gptkb:OpenAI,_Inc.
                                        
                                         | 
                            
| gptkbp:bfsLayer | 
                                    
                                        
                                            
                                            7
                                        
                                        
                                         | 
                            
| https://www.w3.org/2000/01/rdf-schema#label | 
                                    
                                        
                                            
                                            OpenAI Evals
                                        
                                        
                                         |