Language Is Not All You Need: Aligning Perception with Language Models

GPTKB entity

Statements (27)
Predicate Object
gptkbp:instanceOf gptkb:academic_journal
gptkbp:arXivID 2205.01697
gptkbp:author gptkb:Wei_Li
Yifan Xu
Qi Tian
Yunhe Wang
Jianmin Wang
Qinghao Hu
Tong Zhang
Yuhui Yuan
Zhenda Xie
Zhenguo Li
Zihang Jiang
gptkbp:citation high (hundreds, as of 2024)
gptkbp:contribution introduces Perceiver-LLM, a model aligning perception with language models
gptkbp:field gptkb:artificial_intelligence
gptkb:machine_learning
gptkbp:focusesOn large language models
multimodal learning
vision-language models
https://www.w3.org/2000/01/rdf-schema#label Language Is Not All You Need: Aligning Perception with Language Models
gptkbp:language English
gptkbp:proposedBy Perceiver-LLM architecture
gptkbp:publicationYear 2022
gptkbp:publishedIn gptkb:arXiv
gptkbp:bfsParent gptkb:KOSMOS
gptkbp:bfsLayer 8