Statistical language modeling

GPTKB entity

Statements (58)
Predicate Object
gptkbp:instanceOf natural language processing technique
computational linguistics concept
gptkbp:basedOn gptkb:probability_theory
statistics
gptkbp:developedBy 1950s
gptkbp:fieldOfStudy natural language processing
computational linguistics
gptkbp:goal estimate probability of word sequences
gptkbp:hasComponent gptkb:dictionary
gptkb:organization
feature extraction
tokenization
context window
model parameters
training corpus
gptkbp:hasEvaluationMetric cross-entropy
accuracy
perplexity
BLEU score
word error rate
gptkbp:hasMethod interpolation
neural network modeling
maximum entropy modeling
n-gram modeling
Bayesian language models
backoff models
cache models
smoothing techniques
topic models
https://www.w3.org/2000/01/rdf-schema#label Statistical language modeling
gptkbp:limitation computational cost
data sparsity
bias in training data
context window size
out-of-vocabulary words
gptkbp:notableContributor gptkb:Geoffrey_Hinton
gptkb:Yoshua_Bengio
gptkb:Frederick_Jelinek
gptkb:Christopher_Manning
gptkb:Andrei_Markov
gptkbp:notableFor gptkb:OpenAI_GPT
gptkb:IBM_Watson
gptkb:BERT
gptkb:Google_Ngram_Viewer
gptkbp:relatedTo gptkb:model
Markov chain
recurrent neural network
n-gram model
maximum entropy model
neural language model
gptkbp:usedFor information retrieval
machine translation
speech recognition
optical character recognition
text generation
spelling correction
gptkbp:bfsParent gptkb:Language_modeling
gptkbp:bfsLayer 7