GShard

GPTKB entity

Statements (30)
Predicate Object
gptkbp:instanceOf machine learning framework
gptkbp:author gptkb:Ashish_Vaswani
gptkb:Niki_Parmar
gptkb:Zhenzhong_Lan
gptkb:Peter_Hawkins
gptkb:Noam_Shazeer
gptkb:Quoc_V._Le
gptkb:Jeffrey_Dean
Dustin Tran
HyoukJoong Lee
Jiquan Ngiam
Penporn Koanantakool
Youlong Cheng
gptkbp:developedBy gptkb:Google
gptkbp:enables conditional computation
automatic sharding of models
training of trillion-parameter models
https://www.w3.org/2000/01/rdf-schema#label GShard
gptkbp:notablePublication GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
gptkbp:openSource no
gptkbp:publishedIn gptkb:arXiv
gptkbp:relatedTo gptkb:TensorFlow
gptkb:Mixture_of_Experts
gptkb:TPU
gptkbp:releaseYear 2020
gptkbp:supports model parallelism
gptkbp:usedFor scaling deep learning models
gptkbp:usedIn gptkb:Google_Translate
gptkbp:bfsParent gptkb:MoE_Transformer
gptkbp:bfsLayer 6