Statements (27)
Predicate | Object |
---|---|
gptkbp:instanceOf |
text-to-audio generation model
|
gptkbp:application |
gptkb:music
audio generation text-to-audio synthesis |
gptkbp:arXivID |
2301.12503
|
gptkbp:basedOn |
gptkb:Latent_Diffusion_Model
|
gptkbp:developer |
gptkb:Yuan_Gong
gptkb:Chenglin_Xu gptkb:Mark_D._Plumbley gptkb:Qiuqiang_Kong gptkb:Wenwu_Wang Yuxuan Wang |
https://www.w3.org/2000/01/rdf-schema#label |
AudioLDM
|
gptkbp:input |
text prompt
|
gptkbp:language |
gptkb:Python
|
gptkbp:license |
gptkb:MIT_License
|
gptkbp:notablePublication |
gptkb:AudioLDM:_Text-to-Audio_Generation_with_Latent_Diffusion_Models
|
gptkbp:output |
audio waveform
|
gptkbp:relatedTo |
gptkb:Stable_Diffusion
text-to-image generation |
gptkbp:releaseYear |
2023
|
gptkbp:repository |
https://github.com/haoheliu/AudioLDM
|
gptkbp:uses |
gptkb:Diffusion_Model
gptkb:VAE_(Variational_Autoencoder) pre-trained CLAP model |
gptkbp:bfsParent |
gptkb:Diffusers
|
gptkbp:bfsLayer |
6
|