gptkbp:instanceOf
|
gptkb:performance
|
gptkbp:assesses
|
gptkb:Matthews_correlation_coefficient
F1 score
accuracy
|
gptkbp:author
|
Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel Bowman
|
gptkbp:citation
|
gptkb:GLUE:_A_Multi-Task_Benchmark_and_Analysis_Platform_for_Natural_Language_Understanding
gptkb:ICLR_2019
2018
|
gptkbp:dataSource
|
public datasets
|
gptkbp:field
|
gptkb:machine_learning
natural language processing
|
gptkbp:focus
|
sentiment analysis
textual entailment
semantic similarity
paraphrase detection
linguistic acceptability
sentence-level tasks
|
gptkbp:fullName
|
gptkb:General_Language_Understanding_Evaluation
|
https://www.w3.org/2000/01/rdf-schema#label
|
GLUE benchmark
|
gptkbp:introduced
|
gptkb:Samuel_Bowman
gptkb:Alex_Wang
gptkb:Amanpreet_Singh
gptkb:Felix_Hill
gptkb:Julian_Michael
gptkb:Omer_Levy
|
gptkbp:introducedIn
|
2018
|
gptkbp:language
|
English
|
gptkbp:memiliki_tugas
|
gptkb:CoLA
gptkb:MNLI
gptkb:MRPC
gptkb:QNLI
gptkb:STS-B
gptkb:RTE
gptkb:SST-2
gptkb:WNLI
gptkb:QQP
|
gptkbp:notableModel
|
gptkb:GPT-2
gptkb:BERT
gptkb:ALBERT
gptkb:RoBERTa
gptkb:XLNet
|
gptkbp:openAccess
|
yes
|
gptkbp:purpose
|
evaluate performance of language models
|
gptkbp:relatedBenchmark
|
gptkb:SuperGLUE
|
gptkbp:status
|
widely used
retired leaderboard
|
gptkbp:successor
|
gptkb:SuperGLUE
|
gptkbp:type
|
multi-task benchmark
|
gptkbp:usedFor
|
benchmarking general language understanding
|
gptkbp:website
|
https://gluebenchmark.com/
|
gptkbp:bfsParent
|
gptkb:CoLA
gptkb:MNLI
gptkb:MRPC
gptkb:QNLI
gptkb:STS-B
gptkb:QQP
gptkb:Sam_Bowman
|
gptkbp:bfsLayer
|
6
|