InternVL

GPTKB entity

Statements (20)
Predicate Object
gptkbp:instanceOf multimodal AI model
gptkbp:architecture transformer-based
gptkbp:developedBy gptkb:Shanghai_AI_Laboratory
gptkbp:hasDemo https://opencompass.org.cn/internvl
https://www.w3.org/2000/01/rdf-schema#label InternVL
gptkbp:language gptkb:Chinese
English
gptkbp:openSource yes
gptkbp:relatedTo gptkb:BLIP
gptkb:CLIP
gptkbp:releaseYear 2023
gptkbp:supports image captioning
image-text retrieval
visual question answering
vision-language tasks
gptkbp:trainer large-scale image-text datasets
gptkbp:usedFor multimodal understanding
multimodal generation
gptkbp:bfsParent gptkb:Hugging_Face_models
gptkbp:bfsLayer 7