AudioLM

High-quality audio generation framework

CommonProductOthersAudio GenerationLanguage Model
AudioLM is a framework developed by Google Research for high-quality audio generation with long-term consistency. It maps input audio to discrete token sequences and treats audio generation as a language modeling task in this representational space. By training on a large corpus of raw audio waveforms, AudioLM learns to generate natural and coherent audio continuations, producing grammatically and semantically plausible speech segments even without text or annotations while preserving the speaker's identity and prosody. Furthermore, AudioLM is capable of generating coherent piano music continuations, even though no symbolic representation of music was employed during training.
Visit

AudioLM Visit Over Time

Monthly Visits

42883

Bounce Rate

48.04%

Page per Visit

1.1

Visit Duration

00:00:03

AudioLM Visit Trend

AudioLM Visit Geography

AudioLM Traffic Sources

AudioLM Alternatives