Statements (31)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:machine_learning_framework
|
| gptkbp:author |
gptkb:Ashish_Vaswani
gptkb:Niki_Parmar gptkb:Zhenzhong_Lan gptkb:Peter_Hawkins gptkb:Noam_Shazeer gptkb:Quoc_V._Le gptkb:Jeffrey_Dean Dustin Tran HyoukJoong Lee Jiquan Ngiam Penporn Koanantakool Youlong Cheng |
| gptkbp:developedBy |
gptkb:Google
|
| gptkbp:enables |
conditional computation
automatic sharding of models training of trillion-parameter models |
| gptkbp:notablePublication |
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding
|
| gptkbp:openSource |
no
|
| gptkbp:publishedIn |
gptkb:arXiv
|
| gptkbp:relatedTo |
gptkb:TensorFlow
gptkb:Mixture_of_Experts gptkb:TPU |
| gptkbp:releaseYear |
2020
|
| gptkbp:supports |
model parallelism
|
| gptkbp:usedFor |
scaling deep learning models
|
| gptkbp:usedIn |
gptkb:Google_Translate
|
| gptkbp:bfsParent |
gptkb:Mixture_of_Experts_(MoE)
gptkb:Mixture_of_experts |
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
GShard
|