Statistical language modeling
GPTKB entity
Statements (58)
Predicate | Object |
---|---|
gptkbp:instanceOf |
natural language processing technique
computational linguistics concept |
gptkbp:basedOn |
gptkb:probability_theory
statistics |
gptkbp:developedBy |
1950s
|
gptkbp:fieldOfStudy |
natural language processing
computational linguistics |
gptkbp:goal |
estimate probability of word sequences
|
gptkbp:hasComponent |
gptkb:dictionary
gptkb:organization feature extraction tokenization context window model parameters training corpus |
gptkbp:hasEvaluationMetric |
cross-entropy
accuracy perplexity BLEU score word error rate |
gptkbp:hasMethod |
interpolation
neural network modeling maximum entropy modeling n-gram modeling Bayesian language models backoff models cache models smoothing techniques topic models |
https://www.w3.org/2000/01/rdf-schema#label |
Statistical language modeling
|
gptkbp:limitation |
computational cost
data sparsity bias in training data context window size out-of-vocabulary words |
gptkbp:notableContributor |
gptkb:Geoffrey_Hinton
gptkb:Yoshua_Bengio gptkb:Frederick_Jelinek gptkb:Christopher_Manning gptkb:Andrei_Markov |
gptkbp:notableFor |
gptkb:OpenAI_GPT
gptkb:IBM_Watson gptkb:BERT gptkb:Google_Ngram_Viewer |
gptkbp:relatedTo |
gptkb:model
Markov chain recurrent neural network n-gram model maximum entropy model neural language model |
gptkbp:usedFor |
information retrieval
machine translation speech recognition optical character recognition text generation spelling correction |
gptkbp:bfsParent |
gptkb:Language_modeling
|
gptkbp:bfsLayer |
7
|