Pushing Open Source AI to New Heights: DeepSeek V3 Surpasses Llama3.1 with 14.8 Trillion Tokens of Training Data

AIbase基地

Published inAI News · 3 min read · Dec 27, 2024

381

Chinese artificial intelligence company DeepSeek recently launched a groundbreaking open-source large language model, DeepSeek V3. With 671 billion parameters, this model not only surpasses Meta's Llama3.1 in scale but also outperforms mainstream closed-source models, including GPT-4, in several benchmark tests.

A standout feature of DeepSeek V3 is its powerful performance combined with an efficient development process. The model excelled in competitions on the programming platform Codeforces and led competitors in the Aider Polyglot test, which assesses code integration capabilities. The model was trained on an enormous dataset of 14.8 trillion tokens, achieving a parameter scale 1.6 times that of Llama3.1.

AI Robot Artificial Intelligence (2)

Remarkably, DeepSeek completed the model training in just two months with a cost of $5.5 million, significantly lower than the investment typically required for similar products.

DeepSeek is backed by the Chinese quantitative hedge fund High-Flyer Capital Management. This fund has invested in the construction of a server cluster with 10,000 Nvidia A100 GPUs, valued at approximately $138 million. High-Flyer's founder, Liang Wenfeng, stated that open-source AI will ultimately break the monopoly of current closed models.

DeepSeek V3 is released under a permissive license, allowing developers to download, modify, and use it for various applications, including commercial purposes. Although running the full version still requires robust hardware support, the release of this open-source model marks a significant step forward in open innovation within the AI field.

DeepSeekV3 LargeLanguageModel GPT-4 Llama3.1

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Tencent Video Launches AI Repair for Classic Film and Television Works to Restore 4K Quality

Tencent Video launched AI-enhanced 4K classic shows and movies for SVIP users, including 'Home with Kids' and 'Ne Zha Legend', accessible via 'MAX' option.....

Sep 19, 2025

140

Tencent HuanYuan 3D Studio Makes a Stunning Debut: 3D Creation Speeds Up from Days to Minutes

On September 19, 2025, Tencent launched HuanYuan 3D Studio, an AI workbench specifically designed for 3D designers, game developers, and modelers. This is Tencent's second major release within a week. The platform reduces the 3D asset production cycle from days to minutes, achieving a revolutionary improvement in production efficiency. A one-stop platform covers the entire creative process. The initial version of HuanYuan 3D Studio has been launched, featuring character and prop creation pipelines, integrating the entire workflow from concept design, geometric modeling, to texture mapping, skinning, and animation production. The platform is based on

Sep 19, 2025

370

AI Video Breakthrough! Luma Ray 3 Inference Model Launches - One-Click Thinking to Generate 4K HDR Movies

A milestone upgrade has been achieved in the field of video generation AI. Luma AI officially launched the Ray3 model, a product that is called the world's first inference video model. This model has completely changed the rules of the game for AI video generation through its built-in multimodal reasoning system. The core innovation of Ray3 lies in its intelligent reasoning capability. Unlike traditional random generation models, this model can understand user intent, plan complex scenes, and self-assess output quality as a true creative partner. It first conceptualizes a storyboard in its mind and then iteratively optimizes it, a process similar to that of animated

Sep 19, 2025

140

Meta Launches Horizon Hyperscape Capture Tool, Quest3 Users Can Create Photo-Realistic VR Scenes

Meta Corporation officially launched its VR scanning tool Meta Horizon Hyperscape Capture (Beta) today, allowing users of the Quest3 headset to scan the real world and recreate these scenes in virtual reality with photo-realistic quality. The release of this tool fulfills Meta's commitment made at the Connect 2024 conference. From Demo to Reality: Application of Gaussian Splatting Technology for Users. Last year at the Connect 2024 conference, Meta had previously introduced...

Sep 19, 2025

Luma AI Launches Ray3: The First Innovative Model Supporting HDR Video Generation

Luma AI recently launched a video generation model called Ray3, claiming it is the first product capable of producing professional-grade HDR (High Dynamic Range) videos. Ray3 supports 10-bit, 12-bit, and even 16-bit color depth, and can output EXR format files, allowing it to seamlessly integrate with professional editing and color grading workflows. Additionally, Ray3 has the capability to convert standard SDR videos into HDR, providing users with more creative options. A notable feature of Ray3 is...

Sep 19, 2025

Microsoft Invests $4 Billion to Build the World's Most Powerful AI Data Center, Performance Increased by Tenfold!

Microsoft recently announced that it will build a second AI data center in Wisconsin, USA, with an investment totaling $4 billion. This investment is a significant step in Microsoft's continued expansion in the data center sector, marking its emphasis on artificial intelligence technology and confidence in future development. Microsoft President and Vice Chairman Brad Smith revealed that the new data center will be located in Mount Pleasant, and is expected to be equipped with hundreds of thousands of NVIDIA Blackwell GB200 chips. These high-performance chips will provide powerful support for the operation of AI models.

Sep 19, 2025

Luma AI Launches Ray3: Revolutionizing Video Generation with Reasoning Capabilities, Supporting 16-Bit Color Depth

Artificial intelligence company Luma AI recently launched its latest video generation model, Ray3, aiming to revolutionize the video creation process through its unique HDR (High Dynamic Range) capabilities. Ray3 claims to be the first AI model capable of generating studio-quality HDR videos, marking a significant step forward in video generation technology. Seamless Integration of HDR with Professional Workflows The most notable feature of Ray3 is its support for high dynamic range video. The model can not only generate videos with 10-bit, 12-bit, and even 16-bit color depth, but also convert

Sep 19, 2025

Tongyi DeepResearch Launches! Fully Open-Source AI Model Makes Research Simpler

In the field of artificial intelligence, the latest research results released by the Tongyi DeepResearch team have attracted widespread attention. This breakthrough not only elevates AI from 'being able to chat' to 'being able to conduct research', but also demonstrates its outstanding performance in an open manner. Tongyi DeepResearch has achieved state-of-the-art results in multiple authoritative benchmark tests, with overall capabilities even surpassing many internationally renowned models. Moreover, the model, framework, and solutions are fully open-sourced, truly bringing the productivity of deep research to the world.

Sep 18, 2025

220

GPT-4o Revived! How OpenAI Is Coping With Users' Emotional Dependence On The New Model

Shortly after the release of GPT-5, OpenAI unexpectedly decided to bring back its previous models such as GPT-4o. The strong reaction from users made the company realize that many people had already developed deep emotional attachments to these older models. When GPT-4o was taken offline, many users felt as if they had lost a familiar companion, and this response clearly exceeded OpenAI's expectations. In a recent interview, OpenAI's Chief Product Officer, Nick Turley, addressed this issue in depth.

Sep 18, 2025

100

OpenAI Announces New 'GPT-5 Thinking Adjustment' Feature on ChatGPT Web Version

OpenAI announced on its official platform that it has introduced a new 'Thinking Adjustment' feature for Plus, Pro, and Business users. This new feature allows users to independently choose the thinking time of the GPT-5 model, thereby better balancing the speed and intelligence of the response. This update is now available on the ChatGPT web version. Users can select different modes in the settings to suit their needs. The standard mode will be the default setting, designed to ensure response speed while improving efficiency.

Sep 18, 2025

160

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Pushing Open Source AI to New Heights: DeepSeek V3 Surpasses Llama3.1 with 14.8 Trillion Tokens of Training Data

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Video Launches AI Repair for Classic Film and Television Works to Restore 4K Quality

Tencent HuanYuan 3D Studio Makes a Stunning Debut: 3D Creation Speeds Up from Days to Minutes

AI Video Breakthrough! Luma Ray 3 Inference Model Launches - One-Click Thinking to Generate 4K HDR Movies

Meta Launches Horizon Hyperscape Capture Tool, Quest3 Users Can Create Photo-Realistic VR Scenes

Luma AI Launches Ray3: The First Innovative Model Supporting HDR Video Generation

Microsoft Invests $4 Billion to Build the World's Most Powerful AI Data Center, Performance Increased by Tenfold!

Luma AI Launches Ray3: Revolutionizing Video Generation with Reasoning Capabilities, Supporting 16-Bit Color Depth

Tongyi DeepResearch Launches! Fully Open-Source AI Model Makes Research Simpler

GPT-4o Revived! How OpenAI Is Coping With Users' Emotional Dependence On The New Model

OpenAI Announces New 'GPT-5 Thinking Adjustment' Feature on ChatGPT Web Version

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Pushing Open Source AI to New Heights: DeepSeek V3 Surpasses Llama3.1 with 14.8 Trillion Tokens of Training Data

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Video Launches AI Repair for Classic Film and Television Works to Restore 4K Quality

Tencent HuanYuan 3D Studio Makes a Stunning Debut: 3D Creation Speeds Up from Days to Minutes

AI Video Breakthrough! Luma Ray 3 Inference Model Launches - One-Click Thinking to Generate 4K HDR Movies

Meta Launches Horizon Hyperscape Capture Tool, Quest3 Users Can Create Photo-Realistic VR Scenes

Luma AI Launches Ray3: The First Innovative Model Supporting HDR Video Generation

Microsoft Invests $4 Billion to Build the World's Most Powerful AI Data Center, Performance Increased by Tenfold!

Luma AI Launches Ray3: Revolutionizing Video Generation with Reasoning Capabilities, Supporting 16-Bit Color Depth

Tongyi DeepResearch Launches! Fully Open-Source AI Model Makes Research Simpler

GPT-4o Revived! How OpenAI Is Coping With Users' Emotional Dependence On The New Model

OpenAI Announces New 'GPT-5 Thinking Adjustment' Feature on ChatGPT Web Version

GEO Services