Audio generation

Models to generate and modify audio.

audio-ldm

Text-to-audio generation with latent diffusion models.

21k runs

bark

Multi-language Text-To-Speech Audio Model

20.8k runs

musicgen

Generate music from a prompt or melody

18.2k runs

tango

Text to Audio using iNstruction-Guided diffusiOn

53.8k runs

whisper-by-openai

Transcribing audio files into text

43.4k runs

realistic-voice-cloning

Create song covers with any RVC v2 twRAIned AI voice from audio files.

40k runs