A Breakthrough in Domestic Large Models! DeepSeek R1 Open-Sourced, Performance Rivals OpenAI, Ushering in a New Era of AI Equality

AIbase基地

Published inAI News · 3 min read · Jan 21, 2025

950

DeepSeek has officially released and open-sourced its latest large language model, R1. This model performs exceptionally well and is considered comparable to OpenAI's official version o1. This initiative not only marks another significant breakthrough in domestic AI technology but also provides new options for global AI developers.

DeepSeek R1 has extensively applied reinforcement learning techniques during the post-training phase, significantly enhancing the model's reasoning capabilities even with very few labeled data. In key tasks such as mathematics, coding, and natural language reasoning, DeepSeek R1's performance is on par with OpenAI's official version o1, demonstrating its powerful capabilities.

To give back to the open-source community, DeepSeek has also open-sourced two models, DeepSeek-R1 and DeepSeek-R1-Zero, both with a parameter size of 660B. Additionally, DeepSeek has open-sourced six smaller models through model distillation technology, including models with 32B and 70B parameters. These smaller models surpass OpenAI's o1-mini in multiple capabilities, further enriching the open-source ecosystem.

In terms of API pricing, DeepSeek has also demonstrated an open approach: a cache hit costs only 1 yuan per million input tokens, while a miss costs 4 yuan; output tokens are priced at 16 yuan per million, making the overall pricing more competitive.

Importantly, DeepSeek R1 is licensed under the standard MIT License, allowing users unrestricted commercial use. Additionally, DeepSeek encourages users to utilize the outputs of R1 to train other models, further promoting the popularization and development of AI technology. The open-sourcing of DeepSeek R1 will undoubtedly provide global developers with more powerful tools and inject new vitality into the innovation and application of AI technology, signaling the accelerated arrival of an era of AI technology democratization.

Paper: https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf

API Documentation: https://api-docs.deepseek.com/en/guides/reasoning_model

DeepSeek R1 OpenAI Large Language Models

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

A Daily: Kimi Open Platform Launches Kimi Playground; OpenAI Unveils Major Release of ChatGPT Agent; Suno Introduces Voice Replacement Feature

[AI Daily Summary] Today's AI field saw multiple breakthroughs: 1) Moonshot AI's Kimi Open Platform launched Playground, upgrading AI from a conversational assistant to an intelligent assistant; 2) OpenAI released ChatGPT Agent, capable of performing tasks autonomously; 3) Suno v4.5+ introduced innovative music features such as voice replacement; 4) Google's Veo3 video generation model opened its API but at a high cost; 5) The first real-time video conversion AI model MirageLSD was introduced; 6) VSC

Jul 18, 2025

130

Perplexity Enters India: New Strategy to Challenge OpenAI in the AI Race

Perplexity partners with Bharti Airtel in India, offering 360M users free Pro service for a year. Downloads surged 600%, MAUs up 640%. Collaborating with Paytm, it aims to lead India's AI market despite monetization challenges.....

Jul 18, 2025

Head of ByteDance's Visual Large Model, Yang Jianchao, Announces Temporary Leave; Zhou Chang Takes Over, Drawing Attention

Yang Jianchao, the head of ByteDance's Visual Large Model team, announced a temporary leave due to family reasons, with Zhou Chang, former technical leader of Alibaba's Tongyi Qianwen, taking over. This personnel change occurred during a period of adjustment in ByteDance's AI department, sparking concerns about the stability of the technical roadmap. Yang Jianchao's work information remains in the internal system, and Zhou Chang will lead the global Seed team to continue research on visual multimodal generation. The company emphasized its continued investment in basic research and hopes that the new leader will bring innovative energy. This change highlights the importance of balancing work and health in the fast-paced tech industry.

Jul 18, 2025

VSCode's AI Programming Tool Traycer Excels at Handling Large Codebases

Traycer, a VSCode AI assistant, enhances coding with task breakdown, multi-agent collaboration, and real-time error detection. Offers 14-day trial, excels in large codebases.....

Jul 18, 2025

OpenAI Advisory Board Calls for Strengthened Nonprofit Regulation to Ensure Artificial Intelligence Benefits All Humanity

OpenAI advisory report advocates nonprofit AI governance for democratic participation, suggesting transition to a public benefit corporation to balance profit and social goals, with increased public interest funding.....

Jul 18, 2025

OpenAI Launches ChatGPT Agent: It Can Think Proactively, Browse, Shop, and Create Presentations!

OpenAI launches ChatGPT Agent, enabling AI to autonomously execute tasks like web browsing and form filling. Powered by GPT-4o, it achieves 71.3% accuracy in investment modeling. Currently available for Pro/Plus/Team users, with enterprise expansion planned.....

Jul 18, 2025

170

AI Influences Language Communication! Our Daily Conversations Contain More GPT Vocabulary

German study finds AI like ChatGPT is altering human language, creating 'GPT words'. Analysis shows AI-preferred terms like 'in-depth study' surge in usage across media, revealing unconscious human mimicry of AI speech patterns and raising concerns about technology's cognitive impact.....

Jul 17, 2025

100

API Price is Only 1/25 of Claude Opus, Kimi K2 Strongly Attracts Cursor Users

Cursor's regional restrictions drive developers to adopt Chinese AI Kimi K2, hitting 10B daily tokens. Its API is 5-25x cheaper than Claude, gaining global traction in coding and content creation.....

Jul 17, 2025

130

Google DeepMind Launches MoR Architecture: Expected to Significantly Improve the Efficiency of Large Language Models

DeepMind's Mixture-of-Recursions (MoR) enhances model efficiency via dynamic token routing and recursive depth allocation, outperforming Transformers with fewer parameters. Its selective caching reduces memory pressure, proving especially effective above 360M scale, offering optimized AI deployment solutions.....

Jul 17, 2025

300

New Breakthrough in Medical AI: OpenMed Launches Over 380 Open-Source Models to Revolutionize Global Medical Technology

OpenMed released 380+ free medical NER models on Hugging Face (Apache2.0). These models (109M-568M params) rival paid alternatives, integrated into major AI ecosystems. Addressing global healthcare shortages, the project allows free use/modification, with COVID screening API previously developed in 5 days. Team plans to expand model library for open-source medical AI.....

Jul 17, 2025

120

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

A Breakthrough in Domestic Large Models! DeepSeek R1 Open-Sourced, Performance Rivals OpenAI, Ushering in a New Era of AI Equality

AIbase基地

This article is from AIbase Daily

AI News Recommendations

A Daily: Kimi Open Platform Launches Kimi Playground; OpenAI Unveils Major Release of ChatGPT Agent; Suno Introduces Voice Replacement Feature

Perplexity Enters India: New Strategy to Challenge OpenAI in the AI Race

Head of ByteDance's Visual Large Model, Yang Jianchao, Announces Temporary Leave; Zhou Chang Takes Over, Drawing Attention

VSCode's AI Programming Tool Traycer Excels at Handling Large Codebases

OpenAI Advisory Board Calls for Strengthened Nonprofit Regulation to Ensure Artificial Intelligence Benefits All Humanity

OpenAI Launches ChatGPT Agent: It Can Think Proactively, Browse, Shop, and Create Presentations!

AI Influences Language Communication! Our Daily Conversations Contain More GPT Vocabulary

API Price is Only 1/25 of Claude Opus, Kimi K2 Strongly Attracts Cursor Users

Google DeepMind Launches MoR Architecture: Expected to Significantly Improve the Efficiency of Large Language Models

New Breakthrough in Medical AI: OpenMed Launches Over 380 Open-Source Models to Revolutionize Global Medical Technology