LLaMA-Omni

A low-latency, high-quality end-to-end speech interaction model

CommonProductchattingSpeech InteractionEnd-to-End Model
LLaMA-Omni is a low-latency, high-quality end-to-end speech interaction model built on the Llama-3.1-8B-Instruct architecture, aimed at achieving speech capabilities comparable to GPT-4o. The model supports low-latency speech interactions, generating text and speech responses simultaneously. It completed training in less than 3 days using only 4 GPUs, demonstrating its efficient training capabilities.
Visit

LLaMA-Omni Visit Over Time

Monthly Visits

494758773

Bounce Rate

37.69%

Page per Visit

5.7

Visit Duration

00:06:29

LLaMA-Omni Visit Trend

LLaMA-Omni Visit Geography

LLaMA-Omni Traffic Sources

LLaMA-Omni Alternatives