Statistical language modeling

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:computational_linguistics_concept gptkb:natural_language_processing_technique
gptkbp:basedOn	gptkb:probability_theory statistics
gptkbp:developedBy	1950s
gptkbp:fieldOfStudy	natural language processing computational linguistics
gptkbp:goal	estimate probability of word sequences
gptkbp:hasComponent	gptkb:dictionary gptkb:organization feature extraction tokenization context window model parameters training corpus
gptkbp:hasEvaluationMetric	cross-entropy accuracy perplexity BLEU score word error rate
gptkbp:hasMethod	interpolation neural network modeling maximum entropy modeling n-gram modeling Bayesian language models backoff models cache models smoothing techniques topic models
gptkbp:limitation	computational cost data sparsity bias in training data context window size out-of-vocabulary words
gptkbp:notableContributor	gptkb:Geoffrey_Hinton gptkb:Yoshua_Bengio gptkb:Frederick_Jelinek gptkb:Christopher_Manning gptkb:Andrei_Markov
gptkbp:notableFor	gptkb:OpenAI_GPT gptkb:IBM_Watson gptkb:BERT gptkb:Google_Ngram_Viewer
gptkbp:relatedTo	gptkb:model gptkb:Markov_chain gptkb:recurrent_neural_network n-gram model maximum entropy model neural language model
gptkbp:usedFor	information retrieval machine translation speech recognition optical character recognition text generation spelling correction
gptkbp:bfsParent	gptkb:Language_modeling
gptkbp:bfsLayer	7
http://www.w3.org/2000/01/rdf-schema#label	Statistical language modeling