Stable Audio Open Demo
Generate stereo audio from text prompts
CommonProductMusicAudio GenerationText-to-Audio
Stable Audio Open is a technology that generates stereo audio up to 47 seconds long from text prompts. It comprises three main components: an autoencoder that compresses waveforms to manageable sequence lengths, a T5-based text embedding for text conditioning, and a diffusion model (DiT) that operates within the latent space of the autoencoder. This technology excels at generating audio, capable of producing various types of sounds such as percussion, electronic music, and natural soundscapes based on text prompts.
Stable Audio Open Demo Visit Over Time
Monthly Visits
1809
Bounce Rate
46.33%
Page per Visit
1.0
Visit Duration
00:00:00