SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

CommonProductProductivitySpeechAudio
Developed by the Department of Electronic Engineering, Tsinghua University, and ByteDance, SALMONN is a large language model (LLM) that supports voice, audio events, and music input. Unlike models that only support voice or audio event input, SALMONN can perceive and understand various audio inputs, thereby achieving new capabilities such as multilingual speech recognition and translation, as well as audio-speech co-inference. This can be seen as giving the LLM 'auditory' and cognitive auditory abilities, making SALMONN a step towards artificial general intelligence with auditory capabilities.
Visit

SALMONN Visit Over Time

Monthly Visits

494758773

Bounce Rate

37.69%

Page per Visit

5.7

Visit Duration

00:06:29

SALMONN Visit Trend

SALMONN Visit Geography

SALMONN Traffic Sources

SALMONN Alternatives