MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

GPTKB entity

Statements (32)
Predicate Object
gptkbp:instanceOf gptkb:dataset
gptkbp:abbreviation gptkb:MS_MARCO
gptkbp:author gptkb:Saurabh_Tiwary
gptkb:Mir_Rosenberg
gptkb:Rangan_Majumder
gptkb:Tri_Nguyen
gptkb:Xia_Song
gptkb:Jianfeng_Gao
gptkb:Li_Deng
gptkbp:citation gptkb:MS_MARCO:_A_Human_Generated_MAchine_Reading_COmprehension_Dataset
2016
gptkbp:contains passages from Bing search engine
real anonymized user queries
human generated answers
gptkbp:creator gptkb:Microsoft
gptkbp:domain question answering
machine reading comprehension
gptkbp:format gptkb:JSON
https://www.w3.org/2000/01/rdf-schema#label MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
gptkbp:language English
gptkbp:license custom MS MARCO license
gptkbp:numberOfArticles 8,841,823
gptkbp:numberOfRooms 1,010,916
gptkbp:relatedTo gptkb:SQuAD
gptkb:Natural_Questions
gptkb:TREC
gptkbp:releaseYear 2016
gptkbp:usedFor benchmarking question answering systems
training machine reading comprehension models
gptkbp:website https://microsoft.github.io/msmarco/
gptkbp:bfsParent gptkb:MS_MARCO
gptkbp:bfsLayer 7