Automatic Speech Recognition

GPTKB entity

Statements (60)
Predicate Object
gptkbp:instanceOf gptkb:technology
gptkbp:alsoKnownAs gptkb:ASR
gptkbp:application language learning
voice search
voice assistants
accessibility tools
dictation software
call center analytics
transcription services
gptkbp:assesses real-time factor
sentence error rate
word error rate
gptkbp:challenge domain adaptation
background noise
homophones
accents
speaker variability
gptkbp:developedBy 1950s
gptkbp:field natural language processing
speech processing
https://www.w3.org/2000/01/rdf-schema#label Automatic Speech Recognition
gptkbp:input audio signal
gptkbp:notableFor gptkb:Dragon_NaturallySpeaking
gptkb:Amazon_Alexa
gptkb:Apple_Siri
gptkb:Microsoft_Cortana
gptkb:Baidu_Deep_Speech
gptkb:Google_Speech-to-Text
gptkb:IBM_Shoebox
gptkb:OpenAI_Whisper
gptkbp:output gptkb:text
gptkbp:purpose convert spoken language to text
gptkbp:relatedTo speech synthesis
audio signal processing
language identification
speaker recognition
voice activity detection
gptkbp:standardDataset gptkb:Common_Voice
gptkb:LibriSpeech
gptkb:Switchboard
gptkb:TED-LIUM
gptkb:TIMIT
gptkb:WSJ_Corpus
gptkbp:supportsLanguage gptkb:French
gptkb:German
gptkb:Mandarin
gptkb:Spanish
English
Japanese
many other languages
gptkbp:uses gptkb:machine_learning
gptkb:hidden_Markov_models
deep learning
neural networks
language models
acoustic models
phonetic models
gptkbp:bfsParent gptkb:Speech_Recognition
gptkb:ASR
gptkbp:bfsLayer 6