videodubber

Public

The program for automatic dubbing any video file for a lot of languages.

asr dubbing stt translation video video-processing

Creat：2023-06-04T12:23:01

Update：2025-03-02T21:38:34

Stars

Stars Increase

Related projects

Khoj

agent

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

29233

2个月前

+16today

WhisperX

asr

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

15165

1个月前

NeMo

asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

13716

2年前

Wukong Robot

? wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

6802

1个月前

KrillinAI

Hot

dubbing

A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube，TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具，专业级翻译，一键部署全流程，可以生成适配抖音，小红书，哔哩哔哩，视频号，TikTok，Youtube Shorts等形态的内容

6154

1个月前

+76today

Sherpa Onnx

aarch64

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, support 11 programming languages

5740

1个月前

TEN Agent

Hot

agent

TEN Agent is a conversational voice AI agent powered by TEN, integrating Deepseek, Gemini, OpenAI, RTC, and hardware like ESP32. It enables realtime AI capabilities like seeing, hearing, and speaking, and is fully compatible with platforms like Dify and Coze.

5705

1个月前

+5705today

SenseVoice

Multilingual Voice Understanding Model

5452

1个月前

+1today

Nexa Sdk

asr

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (ASR), and text-to-speech (TTS) capabilities.

4511

1个月前

Wenet

asr

Production First and Production Ready End-to-End Speech Recognition Toolkit

4465

1个月前

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

videodubber

Related projects

Khoj

WhisperX

NeMo

Wukong Robot

KrillinAI

Sherpa Onnx

TEN Agent

SenseVoice

Nexa Sdk

Wenet