Statements (21)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:web_crawl_dataset
|
| gptkbp:access |
requires license
|
| gptkbp:citation |
Callan, Jamie, et al. 'The ClueWeb09 Dataset.' (2009)
|
| gptkbp:contains |
web pages
|
| gptkbp:createdBy |
gptkb:Carnegie_Mellon_University
|
| gptkbp:format |
gptkb:WARC
|
| gptkbp:homeTo |
https://lemurproject.org/clueweb09.php
|
| gptkbp:language |
gptkb:Chinese
English |
| gptkbp:notableCollection |
ClueWeb09 Category A
ClueWeb09 Category B |
| gptkbp:partOf |
gptkb:Lemur_Project
|
| gptkbp:releaseYear |
2009
|
| gptkbp:size |
1 billion web pages
|
| gptkbp:successor |
gptkb:ClueWeb12_dataset
|
| gptkbp:usedFor |
natural language processing
web mining information retrieval research |
| gptkbp:bfsParent |
gptkb:Lemur_Project
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
ClueWeb09 dataset
|