ClueWeb12

GPTKB entity

Statements (16)
Predicate Object
gptkbp:instanceOf web crawl dataset
gptkbp:access licensed
gptkbp:containsDocuments 733 million
gptkbp:createdBy gptkb:Carnegie_Mellon_University
gptkbp:homeTo https://lemurproject.org/clueweb12/
https://www.w3.org/2000/01/rdf-schema#label ClueWeb12
gptkbp:language gptkb:Chinese
English
gptkbp:releaseYear 2012
gptkbp:size 27 terabytes
gptkbp:successor gptkb:ClueWeb09
gptkbp:usedFor natural language processing
web mining
information retrieval research
gptkbp:bfsParent gptkb:TREC_Web_Track
gptkbp:bfsLayer 7