Vision-Language Pretraining research community

GPTKB entity

Predicate	Object
gptkbp:instanceOf	research
gptkbp:collaboratesWith	academic researchers industry researchers
gptkbp:focusesOn	vision-language pretraining
https://www.w3.org/2000/01/rdf-schema#label	Vision-Language Pretraining research community
gptkbp:notableWork	gptkb:ALIGN gptkb:BLIP gptkb:CLIP gptkb:ViLT ALBEF
gptkbp:publishes	gptkb:EMNLP gptkb:CVPR gptkb:ICCV gptkb:NeurIPS gptkb:ACL
gptkbp:relatedTo	computer vision natural language processing multimodal learning
gptkbp:studies	image captioning image-text retrieval visual question answering multimodal representation learning
gptkbp:uses	self-supervised learning contrastive learning large-scale datasets transformer architectures
gptkbp:bfsParent	gptkb:MiniGPT
gptkbp:bfsLayer	6