Yuanxiang Releases MoE Open Source Large Model XVERSE-MoE-A36B with 36 Billion Active Parameters

AIbase基地

Published inAI News · 3 min read · Sep 13, 2024

133

Shenzhen Yuanxiang Information Technology Co., Ltd. recently announced that it has successfully released XVERSE-MoE-A36B, the largest Mixture of Experts (MoE) open-source large model in China. This release marks a significant advancement in the AI field in China, elevating domestic open-source technology to a globally leading level.

The XVERSE-MoE-A36B model boasts a total of 255 billion parameters and 36 billion active parameters, rivaling models with over 100 billion parameters in performance, achieving a leap in performance across levels. The model reduces training time by 30% and improves inference performance by 100%, significantly lowering the cost per token, making low-cost deployment of AI applications possible.

WeChat Screenshot_20240913110614.png

Yuanxiang XVERSE's "High-Performance Toolkit" series models are fully open-source and available for commercial use without any conditions, providing more options for numerous small and medium-sized enterprises, researchers, and developers. The MoE architecture, by combining multiple expert models from different fields, breaks through the limitations of traditional scaling laws, maximizing model performance while expanding the model scale and reducing training and inference computational costs.

In several authoritative evaluations, the performance of Yuanxiang MoE significantly surpasses that of several similar models, including the domestic trillion-parameter MoE model Skywork-MoE, the traditional MoE leader Mixtral-8x22B, and the 314 billion parameter open-source MoE model Grok-1-A86B.

Free Download of Large Models

Hugging Face: https://huggingface.co/xverse/XVERSE-MoE-A36B
ModelScope: https://modelscope.cn/models/xverse/XVERSE-MoE-A36B
Github: https://github.com/xverse-ai/XVERSE-MoE-A36B
Inquiry: opensource@xverse.cn
Official Website: chat.xverse.cn

Tencent Hunyuan-A13B Model API Launches

Recently, Tencent Cloud officially launched the API service for the Tencent Hunyuan A13B model on its official website. The input price is set at 0.5 yuan per million Tokens, and the output price is 2 yuan per million Tokens, which has quickly sparked enthusiastic discussions in the developer community. As the first 13B-level MoE (Mixture of Experts) open-source hybrid inference model in the industry, Hunyuan-A13B features a total of 80B parameters and only 13B activated parameters, achieving performance comparable to leading open-source models of the same architecture, while also demonstrating efficient reasoning capabilities.

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://top.aibase.com/1. Qwen-TTS Launches with a Major Breakthrough in Dialect Speech Synthesis, Achieving Realism Close to Human Voices. The Qwen-TTS model, developed by Alibaba's Tongyi team, has made significant breakthroughs in the field of speech synthesis.

ByteDance Releases Innovative Image Synthesis Technology XVerse: Independent and Precise Control over Multiple Individuals

On June 26, 2025, ByteDance officially launched its latest image synthesis technology - XVerse, aimed at providing a high-precision multi-subject image generation solution. This innovative technology enables users to independently and precisely control multiple individuals, greatly enhancing the ability to generate personalized and complex scenes. The core of XVerse lies in its unique DiT modulation method, which allows control over the identity and semantic attributes of each subject without affecting the overall latent features of the image. By converting reference images into specific characteristics...

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Yuanxiang Releases MoE Open Source Large Model XVERSE-MoE-A36B with 36 Billion Active Parameters

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Hunyuan-A13B Model API Launches

Hugging Face Launches SmolLM3: A 3B-Parameter Small Model Competes with 4B Giants, 128K Context Leads a New Trend in Efficient AI!

Tencent Open-Sourced Huan Yuan-A13B: A Dynamic Inference Large Model, Focused on Thinking

B站 Launches HAI Creation Tool, Fully Expanding into Video Podcasts

B站AniSora V3 Launches with a Strong Impact: A Faster and More Efficient Anime Video Generation Tool

ByteDance Open Sources New Model VINCIE-3B: 300 Million Parameters Support Continuous Image Editing with Context

DeepSWE Open Source AI Agent System Makes a Strong Debut, Based on Qwen3-32B

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

ByteDance Releases Innovative Image Synthesis Technology XVerse: Independent and Precise Control over Multiple Individuals

Tencent Open Sources Hunyuan-A13B: An AI Model with Small Size and Great Intelligence