GLUE benchmark

GPTKB entity

Statements (58)
Predicate Object
gptkbp:instanceOf gptkb:performance
gptkbp:assesses gptkb:Matthews_correlation_coefficient
F1 score
accuracy
gptkbp:author Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel Bowman
gptkbp:citation gptkb:GLUE:_A_Multi-Task_Benchmark_and_Analysis_Platform_for_Natural_Language_Understanding
gptkb:ICLR_2019
2018
gptkbp:dataSource public datasets
gptkbp:field gptkb:machine_learning
natural language processing
gptkbp:focus sentiment analysis
textual entailment
semantic similarity
paraphrase detection
linguistic acceptability
sentence-level tasks
gptkbp:fullName gptkb:General_Language_Understanding_Evaluation
https://www.w3.org/2000/01/rdf-schema#label GLUE benchmark
gptkbp:introduced gptkb:Samuel_Bowman
gptkb:Alex_Wang
gptkb:Amanpreet_Singh
gptkb:Felix_Hill
gptkb:Julian_Michael
gptkb:Omer_Levy
gptkbp:introducedIn 2018
gptkbp:language English
gptkbp:memiliki_tugas gptkb:CoLA
gptkb:MNLI
gptkb:MRPC
gptkb:QNLI
gptkb:STS-B
gptkb:RTE
gptkb:SST-2
gptkb:WNLI
gptkb:QQP
gptkbp:notableModel gptkb:GPT-2
gptkb:BERT
gptkb:ALBERT
gptkb:RoBERTa
gptkb:XLNet
gptkbp:openAccess yes
gptkbp:purpose evaluate performance of language models
gptkbp:relatedBenchmark gptkb:SuperGLUE
gptkbp:status widely used
retired leaderboard
gptkbp:successor gptkb:SuperGLUE
gptkbp:type multi-task benchmark
gptkbp:usedFor benchmarking general language understanding
gptkbp:website https://gluebenchmark.com/
gptkbp:bfsParent gptkb:CoLA
gptkb:MNLI
gptkb:MRPC
gptkb:QNLI
gptkb:STS-B
gptkb:QQP
gptkb:Sam_Bowman
gptkbp:bfsLayer 6