arXiv:1810.09434

GPTKB entity

Statements (32)
Predicate Object
gptkbp:instanceOf gptkb:academic_journal
gptkbp:allows We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional representations by jointly conditioning on both left and right context in all layers. As a result, the pre-trained BERT model can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks, such as question answering and language inference, without substantial task-specific architecture modifications.
gptkbp:author gptkb:Jacob_Devlin
gptkb:Kenton_Lee
gptkb:Ming-Wei_Chang
gptkb:Kristina_Toutanova
gptkbp:category cs.CL
stat.ML
gptkbp:citation thousands of scientific papers
gptkbp:contribution Demonstrated state-of-the-art results on multiple NLP tasks
Introduced BERT model for NLP
Popularized transformer-based pre-training for language understanding
https://www.w3.org/2000/01/rdf-schema#label arXiv:1810.09434
gptkbp:influencedBy gptkb:machine_learning
deep learning
natural language processing
question answering
sentiment analysis
text classification
named entity recognition
language inference
transformer models
gptkbp:language English
gptkbp:license arXiv.org perpetual, non-exclusive license
gptkbp:memberSchool gptkb:Google_AI_Language
gptkbp:openAccess true
gptkbp:pdf https://arxiv.org/pdf/1810.09434.pdf
gptkbp:publicationDate 2018-10-11
gptkbp:title gptkb:BERT:_Pre-training_of_Deep_Bidirectional_Transformers_for_Language_Understanding
gptkbp:url https://arxiv.org/abs/1810.09434
gptkbp:bfsParent gptkb:Unitary_Coupled_Cluster_Method
gptkbp:bfsLayer 7