Stability AI, a developer of Stable Diffusion text-to-image has released Stable Audio Open, an open-source model that generates short audio clips and sound effects from textual prompts.
The model can produce drum beats, instrument riffs, ambient sounds, foley recordings, and other audio effects — up to 47 seconds of samples and sound effects. Additionally, the model enables audio variations and style transfer of audio samples.
By using this site, you agree to allow SPEEDA Edge and our partners to use cookies for analytics and personalization. Visit our privacy policy for more information about our data collection practices.