AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

DeepSeek Open Source Week Day 1: Open Source Large Model Accelerating Tool FlashMLA Achieves Decoding Performance of 3000GB/s

AIbase基地

Published inAI News · 2 min read · Feb 24, 2025

745

DeepSeek officially launched its latest technological achievement, FlashMLA, on the first day of the open-source week. This is an efficient Multi-Layer Attention decoding kernel specifically designed for NVIDIA's Hopper architecture GPUs. The technology is particularly optimized for variable-length sequence scenarios, significantly enhancing the inference performance of large models.

Key technical features of FlashMLA include comprehensive support for BF16 precision and a paged key-value cache system with a block size of 64, enabling more precise memory management. In terms of performance, based on the CUDA 12.6 platform, FlashMLA achieved remarkable results on the H800SXM5 GPU: reaching a processing speed of 3000GB/s in memory-constrained scenarios and achieving a computational power level of 580TFLOPS in compute-constrained scenarios.

The project has been validated in production environments, demonstrating excellent stability. The development team stated that the design of FlashMLA draws from the best practices of projects like FlashAttention2 & 3 and Cutlass, achieving innovative breakthroughs on that foundation.

Developers can quickly deploy FlashMLA with a simple installation command: just execute "python setup.py install" to complete the installation, followed by running the test script "python tests/test_flash_mla.py" to experience its performance.

Open-source address: https://github.com/deepseek-ai/FlashMLA

DeepSeek FlashMLA Multi-layerAttention English-StyleGPU

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

BMW to Integrate DeepSeek AI in New China Models

At a recent auto show in Shanghai, German automaker BMW announced it will integrate artificial intelligence technology from Chinese startup DeepSeek into its new models later this year. BMW CEO Oliver Zipse said at the show that the move marks a further strengthening of the company's collaboration with local tech companies in the Chinese market. Zipse emphasized China's rapid pace of innovation in AI and BMW's desire to leverage the technology to enhance the intelligence of its vehicles.

Apr 23, 2025

130

Microsoft's New Open-Source Model MAI-DS-R1: Improved Sensitive Topic Response and Reduced Safety Risks

Apr 18, 2025

510

Li Xiang's MindGPT 3.0 Launched: Deep Thinking Capabilities Rival DeepSeek

Li Auto recently announced the major upgrade of its in-car intelligent assistant, Li Xiang, with the launch of its MindGPT 3.0 model. This upgrade not only marks another significant advancement for Li Auto in the field of artificial intelligence but also brings a more intelligent and efficient user experience.

Apr 18, 2025

670

Asia's Rise: DeepSeek, Massive Investment, and Data Centers Fuel AI Competitiveness

At this year's World Economic Forum in Davos, numerous business and political leaders gathered, largely agreeing that US tech giants dominate the Artificial Intelligence (AI) landscape, with China and Asia seemingly lagging behind. However, in the wake of the event, this perception is being challenged. First, a previously unknown Chinese hedge fund, DeepSeek, has captured global attention. Its AI division has launched a large language model called R1,

Apr 18, 2025

140

DeepSeek Founder Liang Wenfeng Named to Time's 100 Most Influential People of 2025

Apr 17, 2025

290

Yuanbao, WeChat's First AI Assistant, Officially Launches

WeChat has launched its first AI assistant, Yuanbao. Users can simply search for "Yuanbao" within WeChat and add it as a friend to easily begin conversations. Unlike traditional chatbots, Yuanbao appears as a friend, requiring no downloads of additional apps or mini-programs. It even displays a "typing..." message during conversations, creating a more realistic interactive experience. Yuanbao AI is Tencent's latest AI assistant, powered by the HunYuan and DeepSeek dual-engine system, and seamlessly integrates into the WeChat ecosystem.

Apr 17, 2025

460

Major Update! WeChat Welcomes its First AI Assistant, "Yuanbao", Revolutionizing Chat Experience

Just now, Tencent's AI assistant, "Yuanbao," officially joined WeChat. Users can now search "Yuanbao" in the WeChat search bar or scan the QR code to add it to their contact list and start a conversation. Yuanbao AI is the intelligent assistant of the Tencent Yuanbao APP on WeChat, equipped with dual-engine technology powered by HunYuan and DeepSeek, seamlessly integrating with the WeChat ecosystem. It leverages Tencent's HunYuan large model and DeepSeek to provide services including chatting and Q&A, appearing as a contact in the WeChat contact list.

Apr 16, 2025

510

DeepSeek Inference Engine Opens New Path for Open Source, Boosting vLLM Ecosystem

Apr 16, 2025

250

Zhipu Releases New Generation Open-Source GLM Model: 32B Parameters, Rivaling DeepSeek R1 with 8x Faster Speed

Apr 15, 2025

640

THUDM Releases GLM-4: A 3.2 Billion Parameter Model Rivaling GPT-4o and DeepSeek-V3

Apr 15, 2025

380