first-generation Transformer Engine

GPTKB entity

Statements (17)
Predicate Object
gptkbp:instanceOf AI hardware component
gptkbp:announced 2022
gptkbp:developedBy gptkb:NVIDIA
gptkbp:enables higher throughput for AI workloads
reduced memory usage for AI models
gptkbp:feature automated precision scaling
dynamic mixed-precision computation
https://www.w3.org/2000/01/rdf-schema#label first-generation Transformer Engine
gptkbp:introducedIn gptkb:NVIDIA_Hopper_architecture
gptkbp:purpose accelerate transformer model inference
accelerate transformer model training
gptkbp:supports FP16 precision
FP8 precision
BF16 precision
gptkbp:usedIn gptkb:NVIDIA_H100_GPU
gptkbp:bfsParent gptkb:second-generation_Transformer_Engine
gptkbp:bfsLayer 7