Statements (28)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:text-to-speech_model
|
| gptkbp:application |
speech synthesis
|
| gptkbp:author |
Junho Kim, Jaehyeon Kim, Sungwon Kim, Bong-Jin Lee, Sungroh Yoon, and others
|
| gptkbp:basedOn |
gptkb:Variational_Inference
Transformer architecture |
| gptkbp:citation |
2021
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech |
| gptkbp:developedBy |
JUNNYU
|
| gptkbp:firstReleased |
2021
|
| gptkbp:input |
gptkb:text
|
| gptkbp:notableFeature |
fast inference
voice cloning end-to-end TTS high-quality voice synthesis speaker adaptation |
| gptkbp:openSource |
true
|
| gptkbp:output |
audio
|
| gptkbp:relatedTo |
gptkb:Glow-TTS
gptkb:Tacotron gptkb:MelGAN FastSpeech |
| gptkbp:repository |
https://github.com/jaywalnut310/vits
|
| gptkbp:supportsLanguage |
multilingual
|
| gptkbp:usedFor |
voice conversion
singing voice synthesis |
| gptkbp:bfsParent |
gptkb:Hugging_Face_models
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
Vits
|