ViLBERT

GPTKB entity

Statements (28)
Predicate Object
gptkbp:instanceOf multimodal AI model
gptkbp:architecture two-stream model
gptkbp:author gptkb:Devi_Parikh
gptkb:Dhruv_Batra
Jiasen Lu
Stefan Lee
gptkbp:basedOn gptkb:BERT
gptkbp:developedBy gptkb:Facebook_AI_Research
gptkbp:enables image captioning
visual question answering
visual commonsense reasoning
gptkbp:handles vision and language tasks
https://www.w3.org/2000/01/rdf-schema#label ViLBERT
gptkbp:input gptkb:text
images
gptkbp:introducedIn 2019
gptkbp:notablePublication ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
gptkbp:publishedIn gptkb:NeurIPS_2019
gptkbp:relatedTo gptkb:UNITER
gptkb:LXMERT
VisualBERT
gptkbp:repository https://github.com/facebookresearch/vilbert-multi-task
gptkbp:trainer gptkb:COCO
gptkb:Visual_Genome
gptkb:Conceptual_Captions
gptkbp:uses gptkb:transformation
gptkbp:bfsParent gptkb:Visual_Question_Answering
gptkbp:bfsLayer 7