Alibaba Open-Sources Babel, a Multilingual Large Language Model Supporting 25 Languages and Empowering 90% of the Global Population

AIbase基地

Published inAI News · 4 min read · Mar 7, 2025

Alibaba's DAMO Academy has open-sourced a multilingual large language model, Babel, aiming to bridge the language gap and enable AI to communicate in languages spoken by over 90% of the global population.

Many current large language models favor resource-rich languages like English, French, and German. However, similar to how speakers of less-represented languages are often overlooked in global conferences, languages with vast user bases such as Hindi, Bengali, and Urdu are frequently neglected in the AI field.

Alibaba's Babel aims to change this. It supports the top 25 most spoken languages globally, covering over 90% of the world's population. Even more commendable is its inclusion of Swahili, Javanese, Burmese, and other languages rarely explored in open-source LLMs. This initiative will undoubtedly bring more convenient and higher-quality AI language services to billions of people.

Unlike traditional continuous pre-training methods, Babel employs a unique layer expansion technique to enhance its capabilities. This method can be understood as adding "knowledge reserves" to the model's foundation in a more sophisticated way, improving performance while maintaining computational efficiency. The research team has released two distinct models: Babel-9B, optimized for efficient single-GPU inference and fine-tuning; and Babel-83B, an 83-billion-parameter model aiming to set a new benchmark for open-source multilingual LLMs.

To validate Babel's capabilities, the research team conducted rigorous evaluations across multiple multilingual tasks. The results are impressive: both the 9-billion-parameter Babel-9B and the 83-billion-parameter Babel-83B outperformed other open-source models of comparable size in several benchmark tests. For example, Babel excelled in tasks such as world knowledge (MMMLU, M3Exam), reasoning (MGSM, XCOPA), understanding (XNLI), and translation (Flores-200).

Particularly noteworthy is that Babel achieved a 5% to 10% accuracy improvement over previous multilingual LLMs when handling low-resource languages. This demonstrates Babel's focus on performance across various languages while expanding language coverage.

Even more excitingly, after supervised fine-tuning (SFT) on over one million conversational datasets, Babel's chat versions, Babel-9B-Chat and Babel-83B-Chat, demonstrated powerful conversational abilities. Their performance is comparable to some top commercial AI models, with Babel-83B-Chat even rivaling GPT-4 on certain tasks. This injects new vitality into the open-source community, proving that open-source models can achieve leading performance in multilingual capabilities.

Project: https://babel-llm.github.io/babel-llm/

Github: https://github.com/babel-llm/babel-llm

Stanford AI Index Report: Closing Performance Gap Between US and Chinese AI Models, Alibaba Model Rises to Third Globally

The Stanford Institute for Human-Centered Artificial Intelligence (HAI), led by renowned AI scientist Fei-Fei Li, has released its latest AI Index Report 2025. In its eighth year, this authoritative report highlights the narrowing performance gap between top AI models from China and the United States, the world's two most influential AI nations. The gap has shrunk to a negligible 0.3%, down from 17.5% in 2023. The report also features a ranking of Notable Models in 2024, with...

Alibaba Cloud Launches New MCP Service with Gaode and Wuying as First Adopters

Alibaba Cloud officially launched its full lifecycle MCP (Model-Connect-Protocol) service. This innovative platform significantly lowers the barrier to entry for large model application development. Users can quickly create Agents connected to the MCP service in just 5 minutes, achieving full automation from resource management to deployment and operation and maintenance, greatly improving development efficiency. The MCP protocol, as an industry standard for connecting large models to software, is attracting more and more applications, fostering a thriving ecosystem.

AI Daily: Alibaba and Tencent Fully Support MCP Protocol; Step-R1-V-Mini Multimodal Inference Model from Jieyue Xingchen; Meitu's Miracle F1 Image Generation Model

Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest content in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products: https://top.aibase.com/ 1. Alibaba has announced full support for the MCP protocol, followed closely by Tencent. Recently, the Chinese AI field has witnessed a technological standard revolution, with the Model Context Protocol becoming a domestic AI standard.

AI Daily: Alibaba's Qwen3 Model Imminent; GitHub Opensources MCP Server; Runway Releases Gen-4 Turbo

Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest topics in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、Qwen3 is coming soon: Support for Alibaba Cloud's new model has been officially merged into the vLLM code repository. Alibaba Cloud's Qwen3 model is about to be released, marking another significant advancement in its AI endeavors.

Qwen3 is Coming Soon: Alibaba Cloud's New Model Integrates with vLLM, High Performance Anticipated

Recently, Alibaba Cloud's Qwen series of AI large language models has seen significant progress. Support for its next-generation model, Qwen3, has been officially merged into the vLLM (efficient large language model inference framework) codebase. This news has sparked heated discussions in the tech community, signaling that Qwen3's release is imminent. It is understood that Qwen3 will include at least two versions: Qwen3-8B and Qwen3-MoE-15B-A2B, representing innovative attempts at different scales and architectures, for developers and enterprises.