WikiText-103

GPTKB entity

Statements (21)
Predicate Object
gptkbp:instanceOf gptkb:text
gptkbp:availableOn https://blog.einstein.ai/the-wikitext-long-term-dependency-language-modeling-dataset/
gptkbp:contains over 100 million tokens
over 28,000 Wikipedia articles
gptkbp:createdBy gptkb:Salesforce_Research
gptkbp:hasSubgroup gptkb:WikiText-2
gptkbp:language English
gptkbp:license gptkb:Creative_Commons_Attribution-ShareAlike_3.0_Unported_License
gptkbp:partOf WikiText dataset family
gptkbp:releaseYear 2016
gptkbp:usedBy gptkb:Transformer_models
gptkb:GPT-2
gptkb:BERT
gptkbp:usedFor language modeling
benchmarking language models
gptkbp:usedIn gptkb:machine_learning
deep learning
natural language processing
gptkbp:bfsParent gptkb:WikiText
gptkbp:bfsLayer 8
https://www.w3.org/2000/01/rdf-schema#label WikiText-103