FastSpeech2

URI: https://gptkb.org/entity/FastSpeech2

GPTKB entity

Predicate	Object
gptkbp:instanceOf	gptkb:speech_synthesis_model
gptkbp:application	text-to-speech
gptkbp:architecture	non-autoregressive
gptkbp:author	gptkb:Xu_Tan gptkb:Tie-Yan_Liu Sheng Zhao Tao Qin Yangjun Ruan Yi Ren Zhou Zhao
gptkbp:basedOn	FastSpeech
gptkbp:citation	high (over 1000)
gptkbp:developedBy	gptkb:Microsoft_Research_Asia
gptkbp:improves	FastSpeech
gptkbp:input	phoneme sequence
gptkbp:introducedIn	2020
gptkbp:language	English
gptkbp:notableFeature	fast inference speed explicit prosody modeling high-quality speech synthesis parallel generation
gptkbp:notablePublication	FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
gptkbp:openSource	yes
gptkbp:output	mel-spectrogram
gptkbp:predicts	gptkb:energy pitch duration
gptkbp:publicationDate	gptkb:AAAI_2021
gptkbp:url	https://arxiv.org/abs/2006.04558
gptkbp:usedFor	neural TTS
gptkbp:bfsParent	gptkb:Hugging_Face_models
gptkbp:bfsLayer	7
http://www.w3.org/2000/01/rdf-schema#label	FastSpeech2