Statements (27)
| Predicate | Object |
|---|---|
| gptkbp:instanceOf |
gptkb:text-to-audio_generation_model
|
| gptkbp:application |
gptkb:music
audio generation text-to-audio synthesis |
| gptkbp:arXivID |
2301.12503
|
| gptkbp:basedOn |
gptkb:Latent_Diffusion_Model
|
| gptkbp:developer |
gptkb:Yuan_Gong
gptkb:Chenglin_Xu gptkb:Mark_D._Plumbley gptkb:Qiuqiang_Kong gptkb:Wenwu_Wang Yuxuan Wang |
| gptkbp:input |
text prompt
|
| gptkbp:language |
gptkb:Python
|
| gptkbp:license |
gptkb:MIT_License
|
| gptkbp:notablePublication |
gptkb:AudioLDM:_Text-to-Audio_Generation_with_Latent_Diffusion_Models
|
| gptkbp:output |
audio waveform
|
| gptkbp:relatedTo |
gptkb:Stable_Diffusion
text-to-image generation |
| gptkbp:releaseYear |
2023
|
| gptkbp:repository |
https://github.com/haoheliu/AudioLDM
|
| gptkbp:uses |
gptkb:Diffusion_Model
gptkb:VAE_(Variational_Autoencoder) pre-trained CLAP model |
| gptkbp:bfsParent |
gptkb:Diffusers
|
| gptkbp:bfsLayer |
7
|
| https://www.w3.org/2000/01/rdf-schema#label |
AudioLDM
|