SiliconCloud Announces Batch Inference Support for DeepSeek-R1 & V3 APIs with 75% Price Drop

AIbase基地

Published inAI News · 2 min read · Mar 12, 2025

The SiliconCloud platform is pleased to announce the official launch of batch inference capabilities for DeepSeek-R1 & V3API. Users can now send requests to SiliconCloud via batch API, eliminating the constraints of real-time inference rates and completing large-scale data processing tasks within an expected 24 hours.

A major highlight of this update is a significant price reduction. The price of DeepSeek-V3 batch inference is 50% lower than real-time inference. Even better, from March 11th to March 18th, DeepSeek-R1 batch inference enjoys a 75% discount, with input costing only ¥1/million Tokens and output costing ¥4/million Tokens.

The introduction of batch inference aims to help users process large-scale data processing tasks, such as generating reports and data cleaning, more efficiently and at a lower cost. This feature is particularly suitable for data analysis and model performance evaluation scenarios that do not require real-time responses.

It's worth mentioning that DeepSeek-R1 & V3API previously added support for Function Calling, JSON Mode, Prefix, and FIM. Furthermore, the TPM (Tokens Per Minute) limit for the Pro version of DeepSeek-R1 & V3API has been increased from 10,000 to 1,000,000.

SiliconCloud DeepSeek Batch Inference AI API Machine Learning Price Reduction

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Stanford Report Confirms: Alibaba's Qwen Ranks Third Globally in Large Model Contribution, Reshaping Global Competition with Computing Power!

Stanford University's AI Index Report 2025 offers a fresh perspective on the global AI landscape. The report highlights Alibaba's significant contribution, ranking third globally among major large language models, establishing it as a leading Chinese tech company. In 2024, China contributed 15 models globally, with Alibaba contributing 6, trailing only Google and OpenAI with 7 models each. This achievement reflects Alibaba's ongoing commitment to technological innovation.

Apr 12, 2025

ChatGPT Surpasses 46 Million Downloads in March, Becoming the World's Most Popular Non-Gaming App

Apr 12, 2025

Google Gemini Unveils New Circle Screen Feature for Enhanced Search

Google is reportedly developing a new feature called "Circle Screen" to improve the search experience on its Gemini AI platform. According to Android Authority, a video showcasing Gemini's screen sharing capabilities and hinting at this unreleased option was inadvertently posted on Instagram. The highlight of the "Circle Screen" feature is its ability to...

Apr 12, 2025

OpenAI Announces Retirement of GPT-4: A New Chapter in the AI Wave

Apr 12, 2025

Digital Promise Launches AI Product Certification Program to Ensure Safe and Equitable EdTech Tools

The non-profit organization Digital Promise recently announced the launch of its "Responsible Design AI Product Certification" program. This initiative aims to assist school leaders in selecting AI-powered educational technology tools that meet student learning and safety requirements. With the rapid growth of AI in education, schools face an increasing number of choices, making the selection of safe and effective tools a critical challenge. The certification program specifically assesses several key aspects of AI educational tools, including data security and fairness.

Apr 11, 2025

150

Concerns Rise as AI Models Conceal Their Reasoning Processes: Study Finds Their 'Thinking' Often Unreliable

In education, we're taught to "show your work." Now, advanced AI models claim to do just that. However, new research reveals that these models sometimes obfuscate their true reasoning processes, fabricating elaborate explanations instead. A recent study from Anthropic's research team, investigating simulated reasoning (SR) models including their own Claude models and DeepSeek's R1, found these models often misrepresent their 'thinking' when

Apr 11, 2025

200

AI Daily: OpenAI to Potentially Release GPT-4.1 Series Next Week; Pika's New AI Video Feature 'Twists'; SenseTime's 'SenseNova' V6 Makes a Stunning Debut

Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest content in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/ 1. Reports suggest OpenAI will release the GPT-4.1 series next week, including Mini and Nano versions. OpenAI's upcoming release of the GPT-4.1 and o3 series marks a significant advancement in...

Apr 11, 2025

350

Bank of England Warns: Generative AI Could Exacerbate Stock Market Volatility and Manipulation Risks

Apr 11, 2025

140

Google Docs Launches New AI-Powered Audio Overview Feature to Help Users Catch Errors

Google Docs has rolled out a highly anticipated new feature: Audio Overviews. This feature aims to improve writing quality by allowing users to listen to their documents read aloud, offering a convenient multitasking option for busy individuals. Audio Overview reads the document's content aloud, helping authors identify spelling errors and awkward phrasing. Writers often overlook minor mistakes or stylistic issues; this feature helps address that blind spot.

Apr 11, 2025

180

LinkedIn Data: Top 10 Countries with the Highest Concentration of AI Talent Globally, Israel Takes the Lead

Apr 11, 2025

350

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

SiliconCloud Announces Batch Inference Support for DeepSeek-R1 & V3 APIs with 75% Price Drop

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Stanford Report Confirms: Alibaba's Qwen Ranks Third Globally in Large Model Contribution, Reshaping Global Competition with Computing Power!

ChatGPT Surpasses 46 Million Downloads in March, Becoming the World's Most Popular Non-Gaming App

Google Gemini Unveils New Circle Screen Feature for Enhanced Search

OpenAI Announces Retirement of GPT-4: A New Chapter in the AI Wave

Digital Promise Launches AI Product Certification Program to Ensure Safe and Equitable EdTech Tools

Concerns Rise as AI Models Conceal Their Reasoning Processes: Study Finds Their 'Thinking' Often Unreliable

AI Daily: OpenAI to Potentially Release GPT-4.1 Series Next Week; Pika's New AI Video Feature 'Twists'; SenseTime's 'SenseNova' V6 Makes a Stunning Debut

Bank of England Warns: Generative AI Could Exacerbate Stock Market Volatility and Manipulation Risks

Google Docs Launches New AI-Powered Audio Overview Feature to Help Users Catch Errors

LinkedIn Data: Top 10 Countries with the Highest Concentration of AI Talent Globally, Israel Takes the Lead