ViT

URI: https://gptkb.org/entity/ViT

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:convolutional_neural_network
gptkbp:appliesTo	image classification
gptkbp:author	gptkb:Alexey_Dosovitskiy gptkb:Jakob_Uszkoreit gptkb:Alexander_Kolesnikov gptkb:Dirk_Weissenborn gptkb:Georg_Heigold gptkb:Lucas_Beyer gptkb:Matthias_Minderer gptkb:Mostafa_Dehghani gptkb:Neil_Houlsby gptkb:Sylvain_Gelly gptkb:Thomas_Unterthiner gptkb:Xiaohua_Zhai
gptkbp:basedOn	transformer architecture
gptkbp:citation	high
gptkbp:developedBy	gptkb:Google_Research
gptkbp:fullName	gptkb:Vision_Transformer
gptkbp:improves	convolutional neural networks (on large datasets)
gptkbp:input	image patches
gptkbp:inspiredBy	subsequent vision transformer models
gptkbp:introducedIn	2020
gptkbp:openSource	gptkb:TensorFlow gptkb:PyTorch
gptkbp:publishedIn	gptkb:An_Image_is_Worth_16x16_Words:_Transformers_for_Image_Recognition_at_Scale
gptkbp:uses	self-attention mechanism
gptkbp:bfsParent	gptkb:Hugging_Face_models gptkb:SAM_model
gptkbp:bfsLayer	7
http://www.w3.org/2000/01/rdf-schema#label	ViT