ClueWeb09

GPTKB entity

Statements (18)
Predicate Object
gptkbp:instanceOf web crawl dataset
gptkbp:access licensed
gptkbp:containsDocuments 1 billion
gptkbp:createdBy gptkb:Carnegie_Mellon_University
gptkbp:format gptkb:WARC
gptkbp:hasSubgroup ClueWeb09 Category A
ClueWeb09 Category B
gptkbp:homeTo http://lemurproject.org/clueweb09/
https://www.w3.org/2000/01/rdf-schema#label ClueWeb09
gptkbp:language gptkb:Chinese
English
gptkbp:partOf gptkb:TREC_Web_Track
gptkbp:releaseYear 2009
gptkbp:usedFor natural language processing
web mining
information retrieval research
gptkbp:bfsParent gptkb:TREC_Web_Track
gptkbp:bfsLayer 7