The World's Largest Open Source Translation Model! Released by Meta, Supporting 100 Languages and Dialects!

微信公众平台

Published inAI News · 1 min read · Aug 24, 2023

182

Meta has open-sourced the world's largest multimodal translation model, SeamlessM4T, which supports 100 languages and can recognize local dialects. This model can perform multimodal translation tasks including speech-to-text, speech-to-speech, text-to-speech, and text-to-text. SeamlessM4T integrates previous translation models released by Meta, such as NLLB and MMS, and has been trained using a large amount of aligned speech and text data. The model has achieved advanced results in multitask translation and has demonstrated excellent performance in robustness testing, particularly in recognizing background noise and speaker variation. Additionally, this model significantly improves the performance of low-resource languages.

AI Translation Model Multimodal Translation Speech Recognition

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Acceleration of Brain-Computer Interface Industrialization: China's Market Size to Reach 5.58 Billion Yuan by 2027

As the autumn recruitment season approaches, brain-computer interface technology is accelerating its industrialization and has become a new hot spot for college graduates' employment. This cutting-edge interdisciplinary field is expected to reach a market size of 5.58 billion yuan by 2027, with an annual growth rate of 20%. Currently, hundreds of universities and research institutions are involved in its development.

Oct 24, 2025

Microsoft's AI Chief Sulman: Microsoft Will Not Develop Sexual Content AI and Draw a Line with OpenAI

Microsoft's CEO of AI business, Sulman, clearly stated that the company will not develop sexual content AI services, emphasizing that this is not within the scope of its services. This statement was made a week after OpenAI announced allowing adults to create sexual content, highlighting Microsoft's firm stance on the ethics of generative AI.

Oct 24, 2025

AI Daily: Tencent Launches New IMA 2.0; Microsoft Unveils a Series of Major Updates for Copilot; Alibaba's Quark AI Glasses Go on Pre-sale

[AI Daily] The Kimi k2 model from the company Dark Side of the Moon has received praise for its performance surpassing GPT-5, and the company is about to complete another round of tens of millions of dollars in funding, just months after the last funding round. The domestic AI large model field remains highly active, and developers can learn about the latest product updates through the platform.

Oct 24, 2025

110

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

The University of Science and Technology of China and ByteDance jointly launched an end-to-end long video generation model that can directly generate high-quality videos with a duration of minutes, 480p resolution, and 24fps, supporting multi-shot switching. The core innovation is the underlying algorithm MoGA, a novel attention mechanism designed to tackle the challenges of long video generation, marking a key breakthrough in domestic video generation technology.

Oct 24, 2025

130

Ant Group Launches Multimodal Application Lingguang with Built-in AGI Camera, Internal Testing Has Begun

The "Lingguang" application under Alipay has started internal testing, supporting login with a phone number or Alipay account. Its core feature, the "AGI Camera," can recognize real-world scene content through the lens in real time, enabling shooting and questioning as well as intelligent interaction, demonstrating the potential of multimodal AI applications.

Oct 24, 2025

100

Baidu PaddleOCR-VL Model Tops Global OCR Rankings, Continues to Lead Huggingface Trending List for Five Consecutive Days

On October 16, Baidu PaddlePaddle released the vision language model PaddleOCR-VL, achieving a score of 92.56 in the authoritative evaluation OmniDocBench V1.5 with 0.9B parameters, surpassing mainstream models such as DeepSeek-OCR and topping the global OCR rankings. As of October 21, the top three positions on the Huggingface trending list were all occupied by OCR models, with Baidu PaddlePaddle ranking first.

Oct 24, 2025

100

AI Data Center Company Crusoe Completes $1.38 Billion Equity Financing, Valuation Exceeds $10 Billion

Crusoe completed a $1.38 billion equity financing, with its valuation exceeding $10 billion, reflecting investors' high confidence in the AI infrastructure market. The company operates a large data center in Texas, providing services to giants such as OpenAI and Oracle. This round of financing was led by Valor Equity Partners and the Abu Dhabi Sovereign Wealth Fund.

Oct 24, 2025

160

EA and Stability AI Collaborate: Integrating AI into Game Development to Accelerate Content Creation

EA has formed a partnership with Stability AI, integrating AI technologies such as Stable Diffusion into game development. The two parties plan to jointly develop AI models and tools, redefining content production methods, aiming to accelerate iteration and expand creative boundaries. EA emphasizes that AI is positioned as an auxiliary tool to enhance efficiency, supporting rapid iteration and process optimization, rather than replacing human creativity.

Oct 24, 2025

130

Meta Integrates AI Editing Features Directly into Instagram Stories for Instant Dream Effects

Meta integrates AI image editing into Instagram Stories, enabling users to add, remove, or modify photos and videos via text prompts through the 'Restyle' menu, streamlining the editing process.....

Oct 24, 2025

160

Microsoft Launches a New AI Character: Mico, Clippy Returns as an AI Companion

Microsoft introduced the personified AI character Mico at the Copilot Fall Launch Event. The name comes from Microsoft Copilot, and it has features such as listening, changing color, and customization, positioning it as a warm virtual companion. Its inspiration seems to be derived from the classic Office assistant Clippy, and it includes hidden easter egg interactive designs.

Oct 24, 2025

230

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

The World's Largest Open Source Translation Model! Released by Meta, Supporting 100 Languages and Dialects!

微信公众平台

This article is from AIbase Daily

AI News Recommendations

Acceleration of Brain-Computer Interface Industrialization: China's Market Size to Reach 5.58 Billion Yuan by 2027

Microsoft's AI Chief Sulman: Microsoft Will Not Develop Sexual Content AI and Draw a Line with OpenAI

AI Daily: Tencent Launches New IMA 2.0; Microsoft Unveils a Series of Major Updates for Copilot; Alibaba's Quark AI Glasses Go on Pre-sale

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

Ant Group Launches Multimodal Application Lingguang with Built-in AGI Camera, Internal Testing Has Begun

Baidu PaddleOCR-VL Model Tops Global OCR Rankings, Continues to Lead Huggingface Trending List for Five Consecutive Days

AI Data Center Company Crusoe Completes $1.38 Billion Equity Financing, Valuation Exceeds $10 Billion

EA and Stability AI Collaborate: Integrating AI into Game Development to Accelerate Content Creation

Meta Integrates AI Editing Features Directly into Instagram Stories for Instant Dream Effects

Microsoft Launches a New AI Character: Mico, Clippy Returns as an AI Companion

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

The World's Largest Open Source Translation Model! Released by Meta, Supporting 100 Languages and Dialects!

微信公众平台

This article is from AIbase Daily

AI News Recommendations

Acceleration of Brain-Computer Interface Industrialization: China's Market Size to Reach 5.58 Billion Yuan by 2027

Microsoft's AI Chief Sulman: Microsoft Will Not Develop Sexual Content AI and Draw a Line with OpenAI

AI Daily: Tencent Launches New IMA 2.0; Microsoft Unveils a Series of Major Updates for Copilot; Alibaba's Quark AI Glasses Go on Pre-sale

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

Ant Group Launches Multimodal Application Lingguang with Built-in AGI Camera, Internal Testing Has Begun

Baidu PaddleOCR-VL Model Tops Global OCR Rankings, Continues to Lead Huggingface Trending List for Five Consecutive Days

AI Data Center Company Crusoe Completes $1.38 Billion Equity Financing, Valuation Exceeds $10 Billion

EA and Stability AI Collaborate: Integrating AI into Game Development to Accelerate Content Creation

Meta Integrates AI Editing Features Directly into Instagram Stories for Instant Dream Effects

Microsoft Launches a New AI Character: Mico, Clippy Returns as an AI Companion

GEO Services