wenet

Public

Production First and Production Ready End-to-End Speech Recognition Toolkit

asr automatic-speech-recognition conformer e2e-models production-ready pytorch speech-recognition transformer whisper

Creat：2020-11-17T11:57:23

Update：2025-03-26T11:50:11

https://wenet-e2e.github.io/wenet/

4.5K

Stars

Stars Increase

Related projects

WhisperX

asr

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

15152

4周前

+31today

NeMo

asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

13704

2年前

+11today

FunASR

audio-visual-speech-recognition

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

9970

4周前

+29today

Wukong Robot

? wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

6800

4周前

+4today

Sherpa Onnx

aarch64

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 11 programming languages

5722

4周前

+11today

SenseVoice

Multilingual Voice Understanding Model

5438

4周前

+18today

Nexa Sdk

asr

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

4507

4周前

Whisper Diarization

asr

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

4424

4周前

+10today

Streamer Sales

asr

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型??，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。??内含详细的数据生成流程? ?另外还集成了 LMDeploy 加速推理?、RAG检索增强生成 ?、TTS文字转语音?、数字人生成 ?、 Agent 使用网络查询实时信息?、ASR 语音转文字??、Vue 生态搭建前端?、FastAPI 搭建后端??、Docker-compose 打包部署?

3179

4周前

+1today

Lingvo

asr

Lingvo

2837

1个月前

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

wenet

Related projects

WhisperX

NeMo

FunASR

Wukong Robot

Sherpa Onnx

SenseVoice

Nexa Sdk

Whisper Diarization

Streamer Sales

Lingvo