Statements (30)
Predicate | Object |
---|---|
gptkbp:instanceOf |
large language model
|
gptkbp:architecture |
gptkb:transformation
|
gptkbp:attentionMechanism |
disentangled attention
|
gptkbp:author |
gptkb:Pengcheng_He
gptkb:Weizhu_Chen gptkb:Xiaodong_Liu gptkb:Jianfeng_Gao |
gptkbp:availableOn |
gptkb:Hugging_Face_Model_Hub
|
gptkbp:basedOn |
gptkb:DeBERTa
|
gptkbp:developedBy |
gptkb:Microsoft
|
gptkbp:fineTunedWith |
true
|
https://www.w3.org/2000/01/rdf-schema#label |
DeBERTa-Large
|
gptkbp:improves |
gptkb:BERT
gptkb:RoBERTa |
gptkbp:language |
English
|
gptkbp:license |
gptkb:MIT
|
gptkbp:notablePublication |
gptkb:DeBERTa:_Decoding-enhanced_BERT_with_Disentangled_Attention
|
gptkbp:openSource |
true
|
gptkbp:parameter |
435 million
|
gptkbp:pdf |
https://arxiv.org/abs/2006.03654
|
gptkbp:releaseYear |
2021
|
gptkbp:size |
large
|
gptkbp:tokenizer |
gptkb:WordPiece
|
gptkbp:trainer |
large text corpora
|
gptkbp:usedFor |
question answering
natural language understanding text classification named entity recognition |
gptkbp:bfsParent |
gptkb:DeBERTa
|
gptkbp:bfsLayer |
6
|