Statements (31)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:vision_transformer
gptkb:model |
| gptkbp:abbreviation |
gptkb:PVT
|
| gptkbp:application |
image classification
object detection semantic segmentation |
| gptkbp:architecture |
hierarchical transformer
|
| gptkbp:arXivID |
2102.12122
|
| gptkbp:author |
gptkb:Xiaogang_Wang
gptkb:Yun_Liu gptkb:Tianheng_Cheng gptkb:Jifeng_Dai gptkb:Ziwei_Liu Huiyu Wang Jiaming Sun Yanghao Li Yukun Zhu |
| gptkbp:feature |
pyramid structure
multi-scale feature representation spatial reduction attention |
| gptkbp:field |
gptkb:machine_learning
computer vision |
| gptkbp:improves |
gptkb:ResNet
gptkb:Vision_Transformer |
| gptkbp:inspiredBy |
gptkb:Vision_Transformer
|
| gptkbp:introducedIn |
2021
|
| gptkbp:openSource |
yes
|
| gptkbp:publishedIn |
gptkb:arXiv
|
| gptkbp:bfsParent |
gptkb:Vision_Transformer
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
Pyramid Vision Transformer
|