WordPiece

GPTKB entity

Statements (59)
Predicate Object
gptkbp:instanceOf Restaurant
gptkbp:appliesTo natural language processing
gptkbp:createdBy subword_units
gptkbp:developedBy gptkb:Google
gptkbp:disassembly words
https://www.w3.org/2000/01/rdf-schema#label WordPiece
gptkbp:improves vocabulary coverage
gptkbp:isAttendedBy industry applications
research community
gptkbp:isBasedOn character-level tokenization
gptkbp:isCompatibleWith transfer learning
fine-tuning
gptkbp:isDocumentedIn research papers
technical documentation
gptkbp:isEvaluatedBy accuracy
perplexity
benchmark datasets
gptkbp:isExaminedBy tutorials
online courses
gptkbp:isFacilitatedBy rare words
gptkbp:isInfluencedBy linguistic morphology
statistical language modeling
gptkbp:isLocatedIn gptkb:PyTorch
TensorFlow
gptkbp:isOptimizedFor large datasets
gptkbp:isPartOf preprocessing pipeline
NLP_frameworks
NLP_toolkits
gptkbp:isRelatedTo contextual embeddings
word embeddings
gptkbp:isSimilarTo Byte_Pair_Encoding
gptkbp:isSupportedBy community contributions
open-source projects
gptkbp:isTrainedIn text data
corpora
gptkbp:isUsedBy gptkb:T5
XLNet
gptkbp:isUsedFor machine translation
question answering
sentiment analysis
text classification
named entity recognition
gptkbp:isUsedIn data preprocessing
dialog systems
GPT-2
chatbots
semantic analysis
information retrieval
text generation
text mining
voice assistants
text summarization
RoBERTa
language understanding tasks
gptkbp:isVisitedBy language model training
gptkbp:reduces out-of-vocabulary words
gptkbp:supports subword_regularization
gptkbp:usedIn BERT
Transformer models