Vision-Language Pretraining research community

GPTKB entity

Statements (28)
Predicate Object
gptkbp:instanceOf research
gptkbp:collaboratesWith academic researchers
industry researchers
gptkbp:focusesOn vision-language pretraining
https://www.w3.org/2000/01/rdf-schema#label Vision-Language Pretraining research community
gptkbp:notableWork gptkb:ALIGN
gptkb:BLIP
gptkb:CLIP
gptkb:ViLT
ALBEF
gptkbp:publishes gptkb:EMNLP
gptkb:CVPR
gptkb:ICCV
gptkb:NeurIPS
gptkb:ACL
gptkbp:relatedTo computer vision
natural language processing
multimodal learning
gptkbp:studies image captioning
image-text retrieval
visual question answering
multimodal representation learning
gptkbp:uses self-supervised learning
contrastive learning
large-scale datasets
transformer architectures
gptkbp:bfsParent gptkb:MiniGPT
gptkbp:bfsLayer 6