first-generation Transformer Engine
GPTKB entity
Statements (17)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:AI_hardware_component
|
| gptkbp:announced |
2022
|
| gptkbp:developedBy |
gptkb:NVIDIA
|
| gptkbp:enables |
higher throughput for AI workloads
reduced memory usage for AI models |
| gptkbp:feature |
automated precision scaling
dynamic mixed-precision computation |
| gptkbp:introducedIn |
gptkb:NVIDIA_Hopper_architecture
|
| gptkbp:purpose |
accelerate transformer model inference
accelerate transformer model training |
| gptkbp:supports |
FP16 precision
FP8 precision BF16 precision |
| gptkbp:usedIn |
gptkb:NVIDIA_H100_GPU
|
| gptkbp:bfsParent |
gptkb:second-generation_Transformer_Engine
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
first-generation Transformer Engine
|