SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
CommonProductProductivitySpeechAudio
Developed by the Department of Electronic Engineering, Tsinghua University, and ByteDance, SALMONN is a large language model (LLM) that supports voice, audio events, and music input. Unlike models that only support voice or audio event input, SALMONN can perceive and understand various audio inputs, thereby achieving new capabilities such as multilingual speech recognition and translation, as well as audio-speech co-inference. This can be seen as giving the LLM 'auditory' and cognitive auditory abilities, making SALMONN a step towards artificial general intelligence with auditory capabilities.
SALMONN Visit Over Time
Monthly Visits
488643166
Bounce Rate
37.28%
Page per Visit
5.7
Visit Duration
00:06:37