N-gram models

GPTKB entity

Statements (33)
Predicate Object
gptkbp:instanceOf statistical language model
gptkbp:alternativeTo neural language models
gptkbp:application information retrieval
optical character recognition
part-of-speech tagging
text classification
spelling correction
text segmentation
word prediction
gptkbp:basedOn Markov assumption
gptkbp:canBe bigram model
higher-order n-gram model
trigram model
unigram model
gptkbp:example shallow language model
https://www.w3.org/2000/01/rdf-schema#label N-gram models
gptkbp:introducedIn 1940s
gptkbp:limitation data sparsity
limited context window
gptkbp:popularizedBy gptkb:Claude_Shannon
gptkbp:predicts next item in a sequence
gptkbp:requires large corpora for training
gptkbp:smoothingTechnique Laplace smoothing
Good-Turing smoothing
Kneser-Ney smoothing
gptkbp:usedIn machine translation
natural language processing
speech recognition
text prediction
gptkbp:uses smoothing techniques
probability of n-length sequences
gptkbp:bfsParent gptkb:Language_modeling
gptkbp:bfsLayer 7