Vision-Language Pretraining research community
GPTKB entity
Statements (28)
Predicate | Object |
---|---|
gptkbp:instanceOf |
research
|
gptkbp:collaboratesWith |
academic researchers
industry researchers |
gptkbp:focusesOn |
vision-language pretraining
|
https://www.w3.org/2000/01/rdf-schema#label |
Vision-Language Pretraining research community
|
gptkbp:notableWork |
gptkb:ALIGN
gptkb:BLIP gptkb:CLIP gptkb:ViLT ALBEF |
gptkbp:publishes |
gptkb:EMNLP
gptkb:CVPR gptkb:ICCV gptkb:NeurIPS gptkb:ACL |
gptkbp:relatedTo |
computer vision
natural language processing multimodal learning |
gptkbp:studies |
image captioning
image-text retrieval visual question answering multimodal representation learning |
gptkbp:uses |
self-supervised learning
contrastive learning large-scale datasets transformer architectures |
gptkbp:bfsParent |
gptkb:MiniGPT
|
gptkbp:bfsLayer |
6
|