Statements (30)
Predicate | Object |
---|---|
gptkbp:instanceOf |
machine learning framework
|
gptkbp:author |
gptkb:Ashish_Vaswani
gptkb:Niki_Parmar gptkb:Zhenzhong_Lan gptkb:Peter_Hawkins gptkb:Noam_Shazeer gptkb:Quoc_V._Le gptkb:Jeffrey_Dean Dustin Tran HyoukJoong Lee Jiquan Ngiam Penporn Koanantakool Youlong Cheng |
gptkbp:developedBy |
gptkb:Google
|
gptkbp:enables |
conditional computation
automatic sharding of models training of trillion-parameter models |
https://www.w3.org/2000/01/rdf-schema#label |
GShard
|
gptkbp:notablePublication |
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
|
gptkbp:openSource |
no
|
gptkbp:publishedIn |
gptkb:arXiv
|
gptkbp:relatedTo |
gptkb:TensorFlow
gptkb:Mixture_of_Experts gptkb:TPU |
gptkbp:releaseYear |
2020
|
gptkbp:supports |
model parallelism
|
gptkbp:usedFor |
scaling deep learning models
|
gptkbp:usedIn |
gptkb:Google_Translate
|
gptkbp:bfsParent |
gptkb:MoE_Transformer
|
gptkbp:bfsLayer |
6
|