Statements (16)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:dataset
|
| gptkbp:contains |
web pages
|
| gptkbp:createdBy |
gptkb:EleutherAI
|
| gptkbp:domain |
web crawl
|
| gptkbp:language |
English
|
| gptkbp:license |
CC BY 4.0
|
| gptkbp:partOf |
gptkb:The_Pile
|
| gptkbp:releaseYear |
2020
|
| gptkbp:size |
41.2GB
|
| gptkbp:source |
gptkb:Common_Crawl
|
| gptkbp:url |
https://pile.eleuther.ai/
|
| gptkbp:usedFor |
language model training
|
| gptkbp:bfsParent |
gptkb:The_Pile
gptkb:The_Pile:_An_800GB_Dataset_of_Diverse_Text_for_Language_Modeling |
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
Pile-CC
|