Vits

GPTKB entity

Statements (28)
Predicate Object
gptkbp:instanceOf text-to-speech model
gptkbp:application speech synthesis
gptkbp:author Junho Kim, Jaehyeon Kim, Sungwon Kim, Bong-Jin Lee, Sungroh Yoon, and others
gptkbp:basedOn gptkb:Variational_Inference
Transformer architecture
gptkbp:citation 2021
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
gptkbp:developedBy JUNNYU
gptkbp:firstReleased 2021
https://www.w3.org/2000/01/rdf-schema#label Vits
gptkbp:input gptkb:text
gptkbp:notableFeature fast inference
voice cloning
end-to-end TTS
high-quality voice synthesis
speaker adaptation
gptkbp:openSource true
gptkbp:output audio
gptkbp:relatedTo gptkb:Glow-TTS
gptkb:Tacotron
gptkb:MelGAN
FastSpeech
gptkbp:repository https://github.com/jaywalnut310/vits
gptkbp:supportsLanguage multilingual
gptkbp:usedFor voice conversion
singing voice synthesis
gptkbp:bfsParent gptkb:Hugging_Face_models
gptkbp:bfsLayer 7