Need a specific sound to use in your projects? Audio-LDM model can create any sound from a text prompt with customizable duration and quality levels. Great for filmmakers, video content creators and podcasters!
AudioLDM generates text-conditional sound effects, human speech, and music. It enables zero-shot text-guided audio style-transfer, inpainting, and super-resolution.
Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo Mandic, Wenwu Wang, Mark D. Plumley