Dei T-Small

GPTKB entity

Statements (70)
Predicate Object
gptkbp:instance_of gptkb:Transformers
gptkbp:application Image classification
gptkbp:architecture gptkb:Transformers
gptkbp:available_in gptkb:Hugging_Face_Model_Hub
gptkbp:batch_size gptkb:32
gptkbp:can_be_fine_tuned gptkb:Yes
gptkbp:coat_of_arms gptkb:12
gptkbp:developed_by gptkb:Facebook_AI_Research
gptkbp:drops 0.1
gptkbp:evaluates Accuracy
Image Net validation set
gptkbp:focus_area Self-attention
gptkbp:framework_version 1.8.0
gptkbp:headcount gptkb:6
gptkbp:hidden_size 384
https://www.w3.org/2000/01/rdf-schema#label Dei T-Small
gptkbp:image_input_size 224x224
gptkbp:initialization_method Xavier initialization
gptkbp:input_normalization Mean subtraction
gptkbp:input_output Class probabilities
RGB images
gptkbp:intermediate_size 1536
gptkbp:is_a_framework_for gptkb:Py_Torch
gptkbp:is_optimized_for gptkb:Adam_W
gptkbp:is_taught_in 0.001
gptkbp:losses Cross-entropy loss
gptkbp:max_sequence_length gptkb:197
gptkbp:model Small
gptkbp:model_checkpointing gptkb:Yes
gptkbp:orbital_period 22 million
gptkbp:output_activation_function Softmax
gptkbp:performance Top-1 accuracy
gptkbp:predecessor gptkb:Vi_T
gptkbp:provides_information_on gptkb:Image_Net
gptkb:Yes
gptkbp:related_to gptkb:Vision_Transformers
gptkbp:release_year gptkb:2021
gptkbp:successor gptkb:Dei_T-Base
gptkbp:supports_distributed_training gptkb:Yes
gptkbp:supports_mixed_precision_training gptkb:Yes
gptkbp:supports_multi_gputraining gptkb:Yes
gptkbp:supports_quantization gptkb:Yes
gptkbp:supports_transfer_learning gptkb:Yes
gptkbp:top-1_accuracy 79.9%
gptkbp:top-5_accuracy 95.2%
gptkbp:training Supervised learning
Less than 1 day
gptkbp:uses distillation
gptkbp:uses_auto_augment gptkb:Yes
gptkbp:uses_batch_normalization No
gptkbp:uses_cut_mix gptkb:Yes
gptkbp:uses_data_parallelism gptkb:Yes
gptkbp:uses_early_stopping gptkb:Yes
gptkbp:uses_ensemble_learning No
gptkbp:uses_gradient_clipping gptkb:Yes
gptkbp:uses_label_smoothing gptkb:Yes
gptkbp:uses_layer_normalization gptkb:Yes
gptkbp:uses_learning_rate_scheduler gptkb:Yes
gptkbp:uses_mix_up gptkb:Yes
gptkbp:uses_positional_encoding gptkb:Yes
gptkbp:uses_random_erasing gptkb:Yes
gptkbp:uses_residual_connections gptkb:Yes
gptkbp:uses_self_supervised_learning No
gptkbp:uses_semi_supervised_learning No
gptkbp:uses_test_time_augmentation gptkb:Yes
gptkbp:uses_transfer_learning gptkb:Yes
gptkbp:uses_unsupervised_learning No
gptkbp:uses_weight_decay gptkb:Yes
gptkbp:bfsParent gptkb:Dei_T
gptkbp:bfsLayer 5