Statements (19)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:dataset
|
| gptkbp:creator |
gptkb:Facebook_AI_Research
|
| gptkbp:domain |
gptkb:news
|
| gptkbp:fullName |
gptkb:Common_Crawl_News_Dataset
|
| gptkbp:language |
English
|
| gptkbp:purpose |
language model pretraining
|
| gptkbp:releaseYear |
2019
|
| gptkbp:size |
4 billion documents
|
| gptkbp:source |
gptkb:Common_Crawl
|
| gptkbp:url |
https://github.com/facebookresearch/cc_net
|
| gptkbp:usedBy |
gptkb:BART
gptkb:XLM-R gptkb:mBART |
| gptkbp:usedFor |
machine translation
text classification language modeling |
| gptkbp:bfsParent |
gptkb:OSCAR
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
CCNet
|