2025-03-12 16:44:44.AIbase.16.2k
Silicon-Based Flow: DeepSeek-R1 & V3 API Upgrade Supports Batch Inference, R1 Price Reduced by 75%
Silicon-Based Flow's official WeChat account announced that, effective immediately, the DeepSeek-R1 & V3 APIs on the SiliconCloud platform now support batch inference (BatchInference). Users can send requests to SiliconCloud via the batch API, unaffected by real-time inference rate limits, with task completion expected within 24 hours. Compared to real-time inference, DeepSeek-V3 batch inference pricing is reduced by 50%. From March 11th to March 18th, DeepSeek-R1 will enjoy a 75% price reduction.