Statements (33)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:speech_synthesis_model
|
| gptkbp:application |
text-to-speech
|
| gptkbp:architecture |
non-autoregressive
|
| gptkbp:author |
gptkb:Xu_Tan
gptkb:Tie-Yan_Liu Sheng Zhao Tao Qin Yangjun Ruan Yi Ren Zhou Zhao |
| gptkbp:basedOn |
FastSpeech
|
| gptkbp:citation |
high (over 1000)
|
| gptkbp:developedBy |
gptkb:Microsoft_Research_Asia
|
| gptkbp:improves |
FastSpeech
|
| gptkbp:input |
phoneme sequence
|
| gptkbp:introducedIn |
2020
|
| gptkbp:language |
English
|
| gptkbp:notableFeature |
fast inference speed
explicit prosody modeling high-quality speech synthesis parallel generation |
| gptkbp:notablePublication |
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
|
| gptkbp:openSource |
yes
|
| gptkbp:output |
mel-spectrogram
|
| gptkbp:predicts |
gptkb:energy
pitch duration |
| gptkbp:publicationDate |
gptkb:AAAI_2021
|
| gptkbp:url |
https://arxiv.org/abs/2006.04558
|
| gptkbp:usedFor |
neural TTS
|
| gptkbp:bfsParent |
gptkb:Hugging_Face_models
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
FastSpeech2
|