Statements (18)
Predicate | Object |
---|---|
gptkbp:instanceOf |
gptkb:academic_journal
|
gptkbp:affiliation |
gptkb:Google_Research
|
gptkbp:arXivID |
2101.03961
|
gptkbp:author |
gptkb:Noam_Shazeer
gptkb:Barret_Zoph gptkb:William_Fedus |
gptkbp:citation |
high (hundreds to thousands)
|
gptkbp:field |
gptkb:artificial_intelligence
gptkb:machine_learning |
gptkbp:focusesOn |
gptkb:Switch_Transformer
|
https://www.w3.org/2000/01/rdf-schema#label |
Fedus et al., 2021
|
gptkbp:impact |
influential in large language model scaling
|
gptkbp:proposedBy |
Switch Transformer architecture
|
gptkbp:publishedIn |
gptkb:arXiv
|
gptkbp:title |
gptkb:Switch_Transformers:_Scaling_to_Trillion_Parameter_Models_with_Simple_and_Efficient_Sparsity
|
gptkbp:year |
2021
|
gptkbp:bfsParent |
gptkb:MoE_Transformer
|
gptkbp:bfsLayer |
6
|