Statements (23)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:large_language_model
|
| gptkbp:architecture |
gptkb:transformation
|
| gptkbp:attentionMechanism |
disentangled attention
|
| gptkbp:availableOn |
gptkb:Hugging_Face_Model_Hub
|
| gptkbp:basedOn |
gptkb:DeBERTa
|
| gptkbp:developedBy |
gptkb:Microsoft
|
| gptkbp:fineTunedWith |
true
|
| gptkbp:inputLengthLimit |
512 tokens
|
| gptkbp:language |
English
|
| gptkbp:notablePublication |
gptkb:DeBERTa:_Decoding-enhanced_BERT_with_Disentangled_Attention
https://arxiv.org/abs/2006.03654 |
| gptkbp:openSource |
true
|
| gptkbp:parameter |
139 million
|
| gptkbp:releaseYear |
2021
|
| gptkbp:tokenizerType |
gptkb:WordPiece
|
| gptkbp:trainer |
large text corpora
|
| gptkbp:usedFor |
question answering
natural language understanding text classification named entity recognition |
| gptkbp:bfsParent |
gptkb:DeBERTa
|
| gptkbp:bfsLayer |
8
|
| https://www.w3.org/2000/01/rdf-schema#label |
DeBERTa-Base
|