Open Super-large Crawled Aggregated coRpus
GPTKB entity
Statements (18)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:text
|
| gptkbp:abbreviation |
gptkb:OSCAR
|
| gptkbp:availableOn |
gptkb:Hugging_Face_Hub
|
| gptkbp:contains |
web pages
text data |
| gptkbp:firstReleased |
2019
|
| gptkbp:language |
over 100
multilingual |
| gptkbp:license |
CC BY 4.0
|
| gptkbp:maintainedBy |
gptkb:Hugging_Face
|
| gptkbp:size |
trillions of words
|
| gptkbp:source |
gptkb:Common_Crawl
|
| gptkbp:usedFor |
gptkb:machine_learning
natural language processing |
| gptkbp:website |
https://oscar-corpus.com/
|
| gptkbp:bfsParent |
gptkb:OSCAR
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Open Super-large Crawled Aggregated coRpus
|