Cross-lingual Language Model Pretraining

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:machine_learning_technique
gptkbp:application	multilingual NLP
gptkbp:approach	self-supervised learning supervised learning unsupervised learning
gptkbp:architecture	gptkb:transformation encoder-decoder decoder-only encoder-only
gptkbp:benefit	language transfer multilingual applications improved performance on low-resource languages
gptkbp:challenge	data imbalance low-resource languages alignment of representations tokenization across languages
gptkbp:enables	zero-shot learning cross-lingual transfer multilingual understanding
gptkbp:field	natural language processing
gptkbp:goal	learn representations across languages
gptkbp:improves	machine translation named entity recognition cross-lingual question answering cross-lingual text classification
gptkbp:key	gptkb:Unsupervised_Cross-lingual_Representation_Learning_at_Scale_(Conneau_et_al.,_2020) gptkb:XLM:_Cross-lingual_Language_Model_Pretraining_(Lample_and_Conneau,_2019)
gptkbp:notableModel	gptkb:mBERT gptkb:XLM gptkb:XLM-R gptkb:InfoXLM gptkb:LaBSE gptkb:Unicoder
gptkbp:proposedBy	2019
gptkbp:relatedTo	gptkb:mBERT gptkb:XLM gptkb:XLM-R gptkb:BERT
gptkbp:usedBy	gptkb:Google gptkb:Microsoft_Research gptkb:Hugging_Face gptkb:Facebook_AI
gptkbp:uses	masked language modeling monolingual corpora parallel corpora translation language modeling
gptkbp:bfsParent	gptkb:Guillaume_Lample
gptkbp:bfsLayer	7
http://www.w3.org/2000/01/rdf-schema#label	Cross-lingual Language Model Pretraining