LAION-5B

GPTKB entity

Statements (28)
Predicate Object
gptkbp:instanceOf image-text dataset
gptkbp:availableOn https://laion.ai/blog/laion-5b/
gptkbp:citation gptkb:Schuhmann_et_al.,_2022
gptkbp:contains over 5 billion image-text pairs
gptkbp:createdBy gptkb:LAION
gptkbp:dataSource gptkb:Common_Crawl
gptkbp:filtering NSFW filtering
image quality filtering
language filtering
gptkbp:format Parquet
https://www.w3.org/2000/01/rdf-schema#label LAION-5B
gptkbp:includes CLIP embeddings
image URLs
text captions
gptkbp:language multilingual
gptkbp:license gptkb:Creative_Common
gptkbp:notableFor training OpenCLIP
training Stable Diffusion
gptkbp:openSource true
gptkbp:relatedTo gptkb:LAION-2B-en
gptkb:LAION-400M
gptkb:LAION-Aesthetics
gptkbp:releaseYear 2022
gptkbp:size 240TB uncompressed
gptkbp:usedFor gptkb:machine_learning
training large vision-language models
gptkbp:bfsParent gptkb:Stable_Diffusion
gptkbp:bfsLayer 5