GLUE benchmark

URI: https://gptkb.org/entity/GLUE_benchmark

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:performance
gptkbp:assesses	gptkb:Matthews_correlation_coefficient F1 score accuracy
gptkbp:author	Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill, Omer Levy, Samuel Bowman
gptkbp:citation	gptkb:GLUE:_A_Multi-Task_Benchmark_and_Analysis_Platform_for_Natural_Language_Understanding gptkb:ICLR_2019 2018
gptkbp:dataSource	public datasets
gptkbp:field	gptkb:machine_learning natural language processing
gptkbp:focus	sentiment analysis textual entailment semantic similarity paraphrase detection linguistic acceptability sentence-level tasks
gptkbp:fullName	gptkb:General_Language_Understanding_Evaluation
gptkbp:introduced	gptkb:Samuel_Bowman gptkb:Alex_Wang gptkb:Amanpreet_Singh gptkb:Felix_Hill gptkb:Julian_Michael gptkb:Omer_Levy
gptkbp:introducedIn	2018
gptkbp:language	English
gptkbp:memiliki_tugas	gptkb:CoLA gptkb:MNLI gptkb:MRPC gptkb:QNLI gptkb:STS-B gptkb:RTE gptkb:SST-2 gptkb:WNLI gptkb:QQP
gptkbp:notableModel	gptkb:GPT-2 gptkb:BERT gptkb:ALBERT gptkb:RoBERTa gptkb:XLNet
gptkbp:openAccess	yes
gptkbp:purpose	evaluate performance of language models
gptkbp:relatedBenchmark	gptkb:SuperGLUE
gptkbp:status	widely used retired leaderboard
gptkbp:successor	gptkb:SuperGLUE
gptkbp:type	multi-task benchmark
gptkbp:usedFor	benchmarking general language understanding
gptkbp:website	https://gluebenchmark.com/
gptkbp:bfsParent	gptkb:QQP
gptkbp:bfsLayer	6
http://www.w3.org/2000/01/rdf-schema#label	GLUE benchmark