XLNet

URI: https://gptkb.org/entity/XLNet

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:large_language_model gptkb:model
gptkbp:application	question answering sentiment analysis text classification language understanding natural language inference
gptkbp:architecture	gptkb:transformation
gptkbp:author	gptkb:Jaime_Carbonell gptkb:Ruslan_Salakhutdinov gptkb:Yiming_Yang gptkb:Samy_Bengio gptkb:Zhilin_Yang gptkb:Zihang_Dai gptkb:Quoc_V._Le
gptkbp:basedOn	Transformer architecture
gptkbp:developedBy	gptkb:Carnegie_Mellon_University gptkb:Google_Brain
gptkbp:improves	gptkb:GPT-2 gptkb:ERNIE gptkb:BERT gptkb:RoBERTa
gptkbp:input	gptkb:text
gptkbp:language	English
gptkbp:license	Apache 2.0
gptkbp:notableFor	permutation-based training objective state-of-the-art results in 2019
gptkbp:notablePublication	gptkb:XLNet:_Generalized_Autoregressive_Pretraining_for_Language_Understanding https://arxiv.org/abs/1906.08237
gptkbp:openSource	true
gptkbp:output	predicted tokens text embeddings
gptkbp:parameter	110 million (XLNet-Base) 340 million (XLNet-Large)
gptkbp:predecessor	gptkb:BERT
gptkbp:publicationYear	2019
gptkbp:repository	https://github.com/zihangdai/xlnet
gptkbp:supports	transfer learning fine-tuning
gptkbp:trainer	gptkb:Wikipedia gptkb:ClueWeb_09 gptkb:Giga5 gptkb:Common_Crawl gptkb:BooksCorpus
gptkbp:uses	autoregressive modeling permutation language modeling relative positional encoding segment recurrence mechanism
gptkbp:bfsParent	gptkb:Stanford_Question_Answering_Dataset_(SQuAD) gptkb:Question_Answering
gptkbp:bfsLayer	6
http://www.w3.org/2000/01/rdf-schema#label	XLNet