ClueWeb 09

GPTKB entity

Statements (17)
Predicate Object
gptkbp:instanceOf web crawl dataset
gptkbp:access by request
gptkbp:contains web pages
gptkbp:createdBy gptkb:Carnegie_Mellon_University
gptkbp:format gptkb:WARC
gptkbp:homeTo https://lemurproject.org/clueweb09.php
https://www.w3.org/2000/01/rdf-schema#label ClueWeb 09
gptkbp:language multiple languages
gptkbp:releaseYear 2009
gptkbp:size 1 billion web pages
gptkbp:successor ClueWeb 12
gptkbp:usedFor natural language processing research
information retrieval research
gptkbp:usedIn gptkb:TREC_Web_Track
INEX Ad Hoc Track
gptkbp:bfsParent gptkb:XLNet
gptkbp:bfsLayer 6