DeepSeek-V3-0324 Quietly Released: A Low-Key Upgrade that Ignites the Tech World

AIbase基地

Published inAI News · 8 min read · Mar 25, 2025

122

On March 24, 2025, DeepSeek, a Chinese artificial intelligence research institute, unexpectedly released the latest version of its flagship language model, DeepSeek-V3-0324, on the Hugging Face platform. This "quietly powerful" update quickly sparked heated discussions within the tech community, with numerous developers and AI enthusiasts sharing their initial experiences and expectations.The following is an in-depth report compiled from community feedback.

I. Mysterious Release: A 685 Billion Parameter Giant Appears Silently

DeepSeek maintained its characteristic low-key approach. According to the tech community, the new model quietly went live on Hugging Face on the morning of March 24th, without any official announcement or press conference. The new version boasts 685 billion parameters, compared to the 671 billion parameters mentioned in the DeepSeek-V3 technical paper released last December. This discrepancy has fueled speculation about potential architectural adjustments. Although the company hasn't disclosed detailed technical specifications, this "surprise attack" was enough to excite the community.

Multiple sources confirmed that DeepSeek only announced the upgrade through a group message, stating that the model was open-sourced on Hugging Face for everyone to download for free. Reports also indicate that third-party platforms quickly provided API support, showcasing the community's rapid response to the new model.

II. Performance Leap: Significant Improvement in Math and Programming Capabilities

The core highlight of this update is the significant performance improvement. Although the company positioned it as a "minor update," initial tests show noticeable progress in mathematical abilities and front-end design. Many technical evaluators reported a significant improvement in the model's programming capabilities, approaching the level of Claude 3.5. Some evaluators shared sample images generated by V3-0324, describing the initial results as "quite good."

Furthermore, early feedback suggests that, in addition to improvements in technical tasks, the new model may offer a more human-like conversational experience. However, since the company hasn't released benchmark data, these initial assessments require further verification.

III. New Open-Source Stance: Enthusiastic Community Response Under the MIT License

Unlike previous versions, DeepSeek-V3-0324 uses the more permissive MIT open-source license, a change widely viewed as positive. Tech commentators point out that, in addition to significantly enhanced programming capabilities, the model also employs a more open open-source license. The model, with its 685 billion parameters, is now available on open-source platforms, reflecting DeepSeek's increasingly open attitude towards the open-source community.

The enthusiastic response in the Hugging Face comments section validates this observation. The dual advantages of open-source and performance improvements position DeepSeek-V3-0324 as a potential industry game-changer, potentially challenging the status of closed-source models like OpenAI GPT-4 or Anthropic Claude 3.5 Sonnet.

IV. User Experience: Seamless Transition from Website to API

DeepSeek also optimized the user experience in this update. According to tech reports, users can directly use the V3-0324 version by simply turning off the "Deep Thinking" function on the official website, while the API interface and usage remain unchanged. This seamless transition design lowers the barrier to entry and has been well-received by the community.

Reports also indicate that third-party platforms have provided API access, demonstrating the ecosystem's rapid adaptability.

V. Future Outlook: A Prelude to R2?

Despite being labeled a "minor upgrade," the impact of this update far exceeded expectations. Many in the tech community speculate whether this paves the way for the upcoming DeepSeek-R2. Previously, DeepSeek's R1 model competed with OpenAI's o1 model in logical reasoning and mathematical tasks, and the release of V3-0324 is seen as building a technical foundation for the next-generation reasoning model. Although DeepSeek hasn't confirmed a specific release date for R2, community anticipation is clearly rising.

VI. Summary: A Powerful Rise Understated

The release of DeepSeek-V3-0324 continues the company's consistent style: a low-key release with outstanding performance. From its 685 billion parameters to its significant improvements in mathematical and programming capabilities, and its open-source strategy under the MIT license, this model undoubtedly injects new vitality into the AI field. As one technical evaluator described it: "Quiet on the surface, but powerful like a tiger." Even before the technical details are fully disclosed, developers and researchers have eagerly engaged in testing, attempting to unlock the full potential of this "silent giant."

As more evaluation results emerge, whether DeepSeek-V3-0324 can truly shake up the existing AI landscape remains a focal point of attention in the coming weeks.What is certain is that DeepSeek is steadily advancing in the global AI race in its own unique way.

AI Daily: Zhipu AI Opens Sources 32B/9B GLM Series Models and Launches Z.ai Domain; OpenAI Releases GPT-4.1 Series Models; Alibaba ModelScope Launches MCP Plaza

Welcome to the "AI Daily" column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest AI topics, focusing on developers, helping you understand technology trends and learn about innovative AI product applications. Discover new AI products here: https://top.aibase.com/ 1. Zhipu AI Launches New Domain Z.ai and Open Sources 32B/9B Series GLM Models Zhipu AI team recently announced the open sourcing of 32B and 9B series GLM models and launched a new interactive...

Moon's Dark Side Launches First Content Community, Kimi, to Enhance User Interaction

Moon's Dark Side recently announced it's conducting a gray-scale test of its first content community product, Kimi, aimed at improving user experience and retention. The product, Kimi, underwent limited testing late last year and is now entering a wider testing phase. According to The Paper, Moon's Dark Side is a company founded in March 2023, led by a team headed by Yang Zhilin, who has a background at Tsinghua University. Core members of the founding team have participated in the development of several well-known large language models, including Google's Gemini and Bard.

Hugging Face Acquires Pollen Robotics to Accelerate Open-Source Robotics

Hugging Face, the AI development platform, has announced the acquisition of French robotics startup Pollen Robotics for an undisclosed sum. This marks Hugging Face's first foray into hardware and aims to promote the global adoption and development of open-source robotics. Pollen Robotics, founded in 2016 and based in Bordeaux, France, is known for its open-source humanoid robot, Reachy2. Priced at approximately $70,000, Reachy2 has been adopted by institutions such as Cornell University.

Zhipu AI Launches New Domain Z.ai and Open-Sources 32B/9B GLM Model Series

Zhipu AI's technology team has announced the open-sourcing of its 32B and 9B GLM (General Language Model) model series, and the official launch of its new interactive platform, Z.ai. This model series includes base models, inference models, and contemplative models, all under a permissive MIT license. This grants developers extensive freedom for use and development, allowing free use for commercial purposes and free distribution.

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Meta's open-source large language model, Llama-4-Maverick, has experienced a dramatic drop in LMArena rankings, plummeting from second place to 32nd. This significant shift has sparked widespread skepticism among developers, who suspect Meta may have manipulated the benchmark by submitting a specially optimized version. The issue stems from Meta's April 6th release of its latest large language model, Llama 4, encompassing three versions: Scout, Maverick, and Behemoth.

OpenGVLab Open-Sources InternVL3 Series of Multimodal Large Language Models

OpenGVLab has open-sourced the InternVL3 series of models, marking a new milestone in the field of Multimodal Large Language Models (MLLMs). The InternVL3 series comprises seven models ranging from 1B to 78B parameters, capable of handling text, images, and videos simultaneously, demonstrating superior overall performance.

Stanford Report Confirms: Alibaba's Qwen Ranks Third Globally in Large Model Contribution, Reshaping Global Competition with Computing Power!

Stanford University's AI Index Report 2025 offers a fresh perspective on the global AI landscape. The report highlights Alibaba's significant contribution, ranking third globally among major large language models, establishing it as a leading Chinese tech company. In 2024, China contributed 15 models globally, with Alibaba contributing 6, trailing only Google and OpenAI with 7 models each. This achievement reflects Alibaba's ongoing commitment to technological innovation.

VisualCloze: A Highly Flexible Image Generation Framework Leveraging Visual Context Learning

Innovation in AI-powered image generation continues at a rapid pace. Hugging Face recently launched VisualCloze, a new tool utilizing Visual In-Context Learning, marking a significant advancement in general image generation frameworks. AIbase, through analysis of recent social media activity, provides an in-depth look at this tool's highlights and potential, offering readers a firsthand report.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview