ImageBind: One Embedding Space To Bind Them All

GPTKB entity

Statements (30)
Predicate Object
gptkbp:instanceOf gptkb:academic_journal
gptkbp:arXivID 2304.08611
gptkbp:author gptkb:Han_Zhang
gptkb:Jitendra_Malik
gptkb:Xinyang_Geng
Chengyao Chen
Dhruv Mahajan
Hao Zhang
Jing Yu Koh
Rohit Girdhar
Yixuan Li
gptkbp:citation high (hundreds, as of 2024)
gptkbp:contribution proposes a model that learns a joint embedding space for six modalities
gptkbp:enables zero-shot learning
cross-modal retrieval
multimodal generation
gptkbp:field representation learning
multimodal machine learning
https://www.w3.org/2000/01/rdf-schema#label ImageBind: One Embedding Space To Bind Them All
gptkbp:mode gptkb:illustrator
gptkb:text
audio
depth
thermal
IMU (inertial measurement unit)
gptkbp:publicationDate 2023
gptkbp:publishedBy gptkb:Meta_AI
gptkbp:repository https://github.com/facebookresearch/ImageBind
gptkbp:bfsParent gptkb:ImageBind
gptkbp:bfsLayer 6