Domestic artificial intelligence company Wenwen Xinqun has announced the open sourcing of its latest developed end-side full-modal understanding AI model — Megrez-3B-Omni. This model is the world's first open source project of its kind, marking the company's innovative development in the AI field. At the same time, Wenwen Xinqun also launched a pure language version of the model, Megrez-3B-Instruct, to further enrich its product line. Founded in May 2023, the founding team comes from the Department of Electronic Engineering at Tsinghua University. The company is committed to creating efficiency.
Kimi Technology Co., Ltd. and Tsinghua University's MADSys laboratory have jointly released an open-source project called Mooncake, aimed at collaboratively building a large model inference architecture centered around KVCache. In June 2024, both parties released the design plan for the Mooncake inference system based on Kimi, which significantly enhances inference throughput through PD separation and a storage-computation architecture, garnering widespread attention in the industry.
In an era of rapid development in artificial intelligence, the intelligence level of large models continues to improve, but the challenges of inference system efficiency are becoming increasingly apparent. Addressing high inference loads, reducing inference costs, and shortening response times has become an important issue faced by the industry. Kimi has collaborated with the MADSys laboratory at Tsinghua University to launch the KVCache-based Mooncake inference system design plan, which will be officially released in June 2024. The Mooncake inference system enhances efficiency through its innovative approach.
During the 2024 World Internet Conference in Wuzhen, Alibaba Group CEO Wu Yongming delivered a keynote speech at the Internet Entrepreneurs Forum on November 21, emphasizing the profound impact of artificial intelligence (AI) on the internet industry. He pointed out that the biggest change in the internet industry this year is still the rapid development of AI technology. Wu Yongming stated that the greatest value of AI is not merely the development of one or two super apps on mobile phones, but in driving productivity transformation across various industries. The development of AI requires a prosperous ecosystem of technology, products, and market.