Statements (17)
Predicate | Object |
---|---|
gptkbp:instanceOf |
web crawl dataset
|
gptkbp:access |
by request
|
gptkbp:contains |
web pages
|
gptkbp:createdBy |
gptkb:Carnegie_Mellon_University
|
gptkbp:format |
gptkb:WARC
|
gptkbp:homeTo |
https://lemurproject.org/clueweb09.php
|
https://www.w3.org/2000/01/rdf-schema#label |
ClueWeb 09
|
gptkbp:language |
multiple languages
|
gptkbp:releaseYear |
2009
|
gptkbp:size |
1 billion web pages
|
gptkbp:successor |
ClueWeb 12
|
gptkbp:usedFor |
natural language processing research
information retrieval research |
gptkbp:usedIn |
gptkb:TREC_Web_Track
INEX Ad Hoc Track |
gptkbp:bfsParent |
gptkb:XLNet
|
gptkbp:bfsLayer |
6
|