Kimi Collaborates with Tsinghua University to Launch Mooncake Open Source Model Inference Architecture to Enhance AI Inference Efficiency

AIbase基地

Published inAI News · 4 min read · Nov 28, 2024

227

In the era of rapid development of artificial intelligence, the intelligence level of large models continues to improve, but the challenges of efficiency in inference systems have become increasingly apparent. Addressing high inference loads, reducing inference costs, and shortening response times have become important issues that the industry faces together.

Kimi Company, in collaboration with the MADSys laboratory at Tsinghua University, has launched the Mooncake inference system design scheme based on KVCache, which was officially released in June 2024.

The Mooncake inference system significantly enhances inference throughput through an innovative PD separation architecture and a computation-centric approach, attracting widespread industry attention. To further promote the application and popularization of this technological framework, Kimi, together with the MADSys laboratory at Tsinghua University and several companies such as 9#AISoft, Alibaba Cloud, and Huawei Storage, has launched the open-source project Mooncake. On November 28, the technical framework of Mooncake was officially launched on the GitHub platform.

The Mooncake open-source project revolves around a large-scale KVCache pool and aims to gradually open-source the high-performance KVCache multi-level cache Mooncake Store in phases. Additionally, the project will be compatible with various inference engines and underlying storage and transmission resources.

Currently, part of the Transfer Engine has been globally open-sourced on GitHub. The ultimate goal of the Mooncake project is to establish a new standard interface for high-performance memory semantic storage in the era of large models and to provide relevant reference implementation solutions.

Xu Xinran, Vice President of Engineering at Kimi, stated: "Through close cooperation with the MADSys laboratory at Tsinghua University, we have jointly created the separated large model inference architecture Mooncake, achieving extreme optimization of inference resources.

Mooncake not only enhances user experience but also reduces costs, providing effective solutions for processing long texts and high concurrency demands." He looks forward to more companies and research institutions joining the Mooncake project to jointly explore more efficient model inference system architectures, allowing products based on large model technology, such as AI assistants, to benefit a wider audience.

Project link: https://github.com/kvcache-ai/Mooncake

Key points:
🌟 Kimi and Tsinghua University jointly released the Mooncake inference system to enhance AI inference efficiency.
🔧 The Mooncake project has been open-sourced on GitHub, aiming to build a standard interface for high-performance memory semantic storage.
🤝 We look forward to more companies and research institutions participating to jointly promote the advancement of AI technology.

Artificial Intelligence Mooncake Kimi Open Source Project

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Shenzhen University's Artificial Intelligence Institute Officially Unveiled, Boosting AI Talent Cultivation

On April 21, 2025, Shenzhen University officially unveiled its Artificial Intelligence Institute, marking a significant step forward in the university's AI education and research. According to Shenzhen TV's Deep Vision News report, the institute will establish a basic research center and a computing platform, and will collaborate with Tencent Cloud to build an industry academy, promoting deep integration of industry, academia, and research. Image Note: Image generated by AI, image authorization service provider Midjourney. Currently, the Artificial Intelligence Institute boasts a strong team of approximately 80 teachers and researchers.

Apr 21, 2025

190

Rapid Advancement of AI in Advertising: Publishers Leading the Way

According to a 2025 early release study by the Interactive Advertising Bureau (IAB), while the adoption of Artificial Intelligence (AI) in advertising is rising, only 30% of advertising professionals have fully integrated AI into their media advertising lifecycle. The study reveals that while agencies and brands primarily leverage AI for audience identification and targeting, publishers are more inclined to utilize AI for ad inventory forecasting and demand analysis. The survey highlights two major challenges facing the advertising industry in AI adoption...

Apr 21, 2025

MIIT: Over 400 National-Level Specialized and Innovative Small and Giant Enterprises Cultivated in the AI Field

Xie Shaofeng, the chief engineer of the Ministry of Industry and Information Technology (MIIT), stated that over 400 national-level specialized and innovative small and giant enterprises have been cultivated in the artificial intelligence field. The next step is to guide patient capital to increase support, accelerate the cultivation of a group of industry-leading enterprises and specialized and innovative SMEs. This includes building an open-source AI community, leveraging the role of the AI Standardization Technical Committee, and accelerating the development of key and urgently needed standards.

Apr 18, 2025

International Arbitration Body Releases New AI Application Guidelines

The Chartered Institute of Arbitrators (CIArb), a leading international arbitration institution, recently released guidelines on the use of artificial intelligence (AI) in arbitration. This initiative aims to provide legal professionals and arbitrators with practical advice on the ethical use of this emerging technology in arbitration proceedings. With rapid technological advancements, AI is increasingly being integrated into various industries, including law and arbitration. AI can play a significant role in document review, evidence analysis, and decision-making support, but its application also raises a number of ethical considerations.

Apr 17, 2025

110

Kimina-Prover: An Open-Source Mathematical Theorem Proving Model

The Kimi team recently released a technical report and open-sourced the preview version of Kimina-Prover, including 1.5B and 7B parameter distilled models, the Kimina-Autoformalizer-7B model for data generation, and a revised miniF2F benchmark dataset. Kimina-Prover, jointly developed by the Numina and Kimi teams, is a mathematical theorem proving model that excels in the field of formal theorem proving.

Apr 17, 2025

240

Jack Ma Reiterates Focus on AI; Alibaba's All-in AI Strategy Draws Attention; Employees Say Performance Not Yet Tied to AI

Alibaba founder Jack Ma recently addressed company employees, reaffirming the importance of artificial intelligence and stating that AI's future role is to liberate, not replace, humanity. Previous market rumors suggested that all Alibaba departments would have AI-driven growth as a core performance metric by 2025. However, an Alibaba employee told the media that performance evaluations are not currently directly linked to AI, which remains an auxiliary tool. In response to inquiries, Alibaba stated that this was not an official announcement.

Apr 15, 2025

250

Global Artificial Intelligence Market Projected to Reach $368 Billion by 2034

Apr 15, 2025

270

SignalFire Raises Over $1 Billion to Focus on Applied AI Startups

Venture capital firm SignalFire announced it has secured over $1 billion in funding to support next-generation early-stage technology startups, particularly those innovating in applied artificial intelligence (AI). This capital will be allocated across multiple programs including SignalFire's Seed, Early, XIR (High-Retention), and Opportunity programs. The firm stated that this funding will be used to back founders pursuing disruptive innovations with the potential to 'reshape entire categories'. Image note: Image generated by AI, image licensing provided by Midjourney.

Apr 15, 2025

150

Moon's Dark Side Launches First Content Community, Kimi, to Enhance User Interaction

Moon's Dark Side recently announced it's conducting a gray-scale test of its first content community product, Kimi, aimed at improving user experience and retention. The product, Kimi, underwent limited testing late last year and is now entering a wider testing phase. According to The Paper, Moon's Dark Side is a company founded in March 2023, led by a team headed by Yang Zhilin, who has a background at Tsinghua University. Core members of the founding team have participated in the development of several well-known large language models, including Google's Gemini and Bard.

Apr 15, 2025

190

Meta Restarts AI Training Using Public Content from European Users

Meta recently announced it will resume training its AI models using publicly available content from European users. This decision follows a pause last year due to data privacy concerns. Meta stated that this AI training will primarily rely on publicly shared posts and comments from adult users across the 27 EU countries. Furthermore, interactions between users and Meta AI, such as questions and queries, will also be used to train and improve its AI models. Image attribution: Image generated by AI, image licensing provided by Midj

Apr 15, 2025

100

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Kimi Collaborates with Tsinghua University to Launch Mooncake Open Source Model Inference Architecture to Enhance AI Inference Efficiency

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shenzhen University's Artificial Intelligence Institute Officially Unveiled, Boosting AI Talent Cultivation

Rapid Advancement of AI in Advertising: Publishers Leading the Way

MIIT: Over 400 National-Level Specialized and Innovative Small and Giant Enterprises Cultivated in the AI Field

International Arbitration Body Releases New AI Application Guidelines

Kimina-Prover: An Open-Source Mathematical Theorem Proving Model

Jack Ma Reiterates Focus on AI; Alibaba's All-in AI Strategy Draws Attention; Employees Say Performance Not Yet Tied to AI

Global Artificial Intelligence Market Projected to Reach $368 Billion by 2034

SignalFire Raises Over $1 Billion to Focus on Applied AI Startups

Moon's Dark Side Launches First Content Community, Kimi, to Enhance User Interaction

Meta Restarts AI Training Using Public Content from European Users