SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

CommonProductProductivitySpeechAudio

Developed by the Department of Electronic Engineering, Tsinghua University, and ByteDance, SALMONN is a large language model (LLM) that supports voice, audio events, and music input. Unlike models that only support voice or audio event input, SALMONN can perceive and understand various audio inputs, thereby achieving new capabilities such as multilingual speech recognition and translation, as well as audio-speech co-inference. This can be seen as giving the LLM 'auditory' and cognitive auditory abilities, making SALMONN a step towards artificial general intelligence with auditory capabilities.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

SALMONN

SALMONN Visit Over Time

SALMONN Visit Trend

SALMONN Visit Geography

SALMONN Traffic Sources

SALMONN Alternatives

SALMONN — SALMONN: Speech Audio Language Music Open Neural Network

Acoust — Instantly create natural-sounding audio.

Maidio — Maidio is an intelligent application that transforms RSS news content into conversational podcasts using AI.

MaiYou Radio — MaiYou Radio transforms news into a conversational format using AI technology, creating a personalized radio experience.

Hailuo — Your ultimate smart solution AI assistant.

PodRedit — A podcast sharing platform for discovering popular podcasts.

Gardener Teleprompter — Smart AI teleprompter supporting voice read-back and invisible prompting to enhance live streaming experience.

PodSnap.AI — AI-generated podcast summaries ensure you never miss any exciting content.

Read Fast — An intelligent reading tool that enhances the reading experience

GG Rewriter — Leverages artificial intelligence to assist users in writing better and faster.

Journi — Platform to showcase your travels to a global audience

LangAI — Learn multiple languages through AI-powered chat

Butter Reader — Transform blog text into engaging audio.

Best Man Pro — A personalized Best Man speech assistant

ToastwithAI — Wedding Speech AI | Generate Your Wedding Speech with AI

Chat gpt RTL — Enables ChatGPT to handle right-to-left text.

Felo Translator — Voice translation, supports 15 languages

Ad Auris — Read articles anytime, anywhere.

SpeechGPT — Multimodal Language Model

Konch — Fast and accurate automatic transcription service

Youtube AI Subtitle and Web Translator — Trancy provides AI bilingual subtitles for YouTube and Netflix, as well as ChatGPT AI web translation.

FreGrad — Lightweight and fast frequency-aware diffusion audio codec

Tooltips AI — Reading, understanding, super-fast

Heartstring AI — AI-powered platform to help you write heartfelt and engaging speeches.

Summify - Summarize Speech — Easily record and summarize speech content

Unified-IO 2 — A unified multi-modal generation model

Jellypod — Turn your inbox into a personalized daily podcast.

Huddles — From casual conversations to in-depth collaborative meetings, Huddles provides a new lightweight audio or video connection method to connect anytime, anywhere.

Tutur — Enhancing Language Proficiency with AI

Read — Read generates personalized daily news audio briefs for users.