Statistical language modeling
GPTKB entity
Statements (58)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:computational_linguistics_concept
gptkb:natural_language_processing_technique |
| gptkbp:basedOn |
gptkb:probability_theory
statistics |
| gptkbp:developedBy |
1950s
|
| gptkbp:fieldOfStudy |
natural language processing
computational linguistics |
| gptkbp:goal |
estimate probability of word sequences
|
| gptkbp:hasComponent |
gptkb:dictionary
gptkb:organization feature extraction tokenization context window model parameters training corpus |
| gptkbp:hasEvaluationMetric |
cross-entropy
accuracy perplexity BLEU score word error rate |
| gptkbp:hasMethod |
interpolation
neural network modeling maximum entropy modeling n-gram modeling Bayesian language models backoff models cache models smoothing techniques topic models |
| gptkbp:limitation |
computational cost
data sparsity bias in training data context window size out-of-vocabulary words |
| gptkbp:notableContributor |
gptkb:Geoffrey_Hinton
gptkb:Yoshua_Bengio gptkb:Frederick_Jelinek gptkb:Christopher_Manning gptkb:Andrei_Markov |
| gptkbp:notableFor |
gptkb:OpenAI_GPT
gptkb:IBM_Watson gptkb:BERT gptkb:Google_Ngram_Viewer |
| gptkbp:relatedTo |
gptkb:model
gptkb:Markov_chain gptkb:recurrent_neural_network n-gram model maximum entropy model neural language model |
| gptkbp:usedFor |
information retrieval
machine translation speech recognition optical character recognition text generation spelling correction |
| gptkbp:bfsParent |
gptkb:Language_modeling
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Statistical language modeling
|