Stable Audio Open Demo

Generate stereo audio from text prompts

CommonProductMusicAudio GenerationText-to-Audio
Stable Audio Open is a technology that generates stereo audio up to 47 seconds long from text prompts. It comprises three main components: an autoencoder that compresses waveforms to manageable sequence lengths, a T5-based text embedding for text conditioning, and a diffusion model (DiT) that operates within the latent space of the autoencoder. This technology excels at generating audio, capable of producing various types of sounds such as percussion, electronic music, and natural soundscapes based on text prompts.
Visit

Stable Audio Open Demo Visit Over Time

Monthly Visits

894

Bounce Rate

45.28%

Page per Visit

1.0

Visit Duration

00:00:00

Stable Audio Open Demo Visit Trend

Stable Audio Open Demo Visit Geography

Stable Audio Open Demo Traffic Sources

Stable Audio Open Demo Alternatives