Statements (28)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:Vision_Transformer_model
|
| gptkbp:activatedBy |
gptkb:GELU
|
| gptkbp:application |
Image classification
|
| gptkbp:architecture |
gptkb:transformation
|
| gptkbp:attentionMechanism |
Self-attention
|
| gptkbp:developedBy |
gptkb:Google_Research
|
| gptkbp:hiddenSize |
768
|
| gptkbp:input |
Image patches
|
| gptkbp:introducedIn |
2020
|
| gptkbp:level |
12
|
| gptkbp:notableFauna |
12
|
| gptkbp:notablePublication |
gptkb:An_Image_is_Worth_16x16_Words:_Transformers_for_Image_Recognition_at_Scale
|
| gptkbp:openSource |
gptkb:Hugging_Face_Transformers
timm |
| gptkbp:parameter |
86 million
|
| gptkbp:platform |
gptkb:TensorFlow
gptkb:PyTorch |
| gptkbp:relatedTo |
gptkb:ViT-H
gptkb:ViT-L |
| gptkbp:resolution |
224x224
|
| gptkbp:size |
16x16
|
| gptkbp:trainer |
gptkb:JFT-300M
ImageNet-21k |
| gptkbp:usedFor |
Fine-tuning
Transfer learning |
| gptkbp:bfsParent |
gptkb:Vision_Transformer
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
ViT-B
|