XLNet

GPTKB entity

Statements (54)
Predicate Object
gptkbp:instanceOf gptkb:model
large language model
gptkbp:application question answering
sentiment analysis
text classification
language understanding
natural language inference
gptkbp:architecture gptkb:transformation
gptkbp:author gptkb:Jaime_Carbonell
gptkb:Ruslan_Salakhutdinov
gptkb:Yiming_Yang
gptkb:Samy_Bengio
gptkb:Zhilin_Yang
gptkb:Zihang_Dai
gptkb:Quoc_V._Le
gptkbp:basedOn Transformer architecture
gptkbp:developedBy gptkb:Carnegie_Mellon_University
gptkb:Google_Brain
https://www.w3.org/2000/01/rdf-schema#label XLNet
gptkbp:improves gptkb:GPT-2
gptkb:ERNIE
gptkb:BERT
gptkb:RoBERTa
gptkbp:input gptkb:text
gptkbp:language English
gptkbp:license Apache 2.0
gptkbp:notableFor permutation-based training objective
state-of-the-art results in 2019
gptkbp:notablePublication gptkb:XLNet:_Generalized_Autoregressive_Pretraining_for_Language_Understanding
https://arxiv.org/abs/1906.08237
gptkbp:openSource true
gptkbp:output predicted tokens
text embeddings
gptkbp:parameter 110 million (XLNet-Base)
340 million (XLNet-Large)
gptkbp:predecessor gptkb:BERT
gptkbp:publicationYear 2019
gptkbp:repository https://github.com/zihangdai/xlnet
gptkbp:supports transfer learning
fine-tuning
gptkbp:trainer gptkb:Wikipedia
gptkb:ClueWeb_09
gptkb:Giga5
gptkb:Common_Crawl
gptkb:BooksCorpus
gptkbp:uses autoregressive modeling
permutation language modeling
relative positional encoding
segment recurrence mechanism
gptkbp:bfsParent gptkb:large_language_model
gptkb:transformation
gptkb:convolutional_neural_network
gptkb:Large_Language_Models
gptkbp:bfsLayer 5