Statements (28)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:image-text_dataset
|
| gptkbp:availableOn |
https://laion.ai/blog/laion-5b/
|
| gptkbp:citation |
gptkb:Schuhmann_et_al.,_2022
|
| gptkbp:contains |
over 5 billion image-text pairs
|
| gptkbp:createdBy |
gptkb:LAION
|
| gptkbp:dataSource |
gptkb:Common_Crawl
|
| gptkbp:filtering |
NSFW filtering
image quality filtering language filtering |
| gptkbp:format |
Parquet
|
| gptkbp:includes |
CLIP embeddings
image URLs text captions |
| gptkbp:language |
multilingual
|
| gptkbp:license |
gptkb:Creative_Common
|
| gptkbp:notableFor |
training OpenCLIP
training Stable Diffusion |
| gptkbp:openSource |
true
|
| gptkbp:relatedTo |
gptkb:LAION-2B-en
gptkb:LAION-400M gptkb:LAION-Aesthetics |
| gptkbp:releaseYear |
2022
|
| gptkbp:size |
240TB uncompressed
|
| gptkbp:usedFor |
gptkb:machine_learning
training large vision-language models |
| gptkbp:bfsParent |
gptkb:Stable_Diffusion
|
| gptkbp:bfsLayer |
6
|
| https://www.w3.org/2000/01/rdf-schema#label |
LAION-5B
|