LSLM

An AI conversational system for real-time voice interaction.

CommonProductchattingArtificial IntelligenceSpeech Recognition
The Listening-while-Speaking Language Model (LSLM) is an AI conversational model aimed at enhancing the naturalness of human-computer interaction. Utilizing full duplex modeling (FDM) technology, it enables the ability to listen while speaking, which significantly boosts real-time interactivity, particularly when generated content lacks satisfaction, allowing for interruptions and immediate responses. LSLM employs a token-based decoder for speech generation through TTS, and a streaming self-supervised learning (SSL) encoder for real-time audio input, exploring the optimal interaction balance through three fusion strategies: early fusion, mid-fusion, and late fusion.
Visit

LSLM Visit Over Time

Monthly Visits

1445

Bounce Rate

21.26%

Page per Visit

1.7

Visit Duration

00:00:01

LSLM Visit Trend

LSLM Visit Geography

LSLM Traffic Sources

LSLM Alternatives