Large Models Enter a New Phase of 'Cost Reduction' After Trillion Scale

脑极体

Published inAI News · 2 min read · Nov 3, 2023

The parameter scale of large models has increased by 100 times, now surpassing the trillion-level threshold, resulting in significant resource consumption and escalating costs for storage, inference, operations, and implementation. Large model enterprises are actively engaging in a "cost-trimming" movement. Firstly, data is being scaled up to enhance the marginal benefits of data through economies of scale; secondly, models are being compressed to operate with faster inference speeds, lower latency, and reduced resource requirements without compromising performance; thirdly, computational efficiency is being improved by enhancing the performance of chips and computing clusters; fourthly, business stratification is occurring, with distinct commercial paths emerging for large models of varying sizes, functionalities, and orientations. To ensure long-term, sustainable service, large models must undergo "cost-trimming," a necessary journey.

Large Models Cost Commercialization

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Xiaomi Launches Embodied Large Model MiMo-Embodied and Opens Source

Xiaomi officially released the embodied large model MiMo-Embodied and announced that the model will be fully open-sourced. This move marks an important step for Xiaomi in the field of general embodied intelligence research.

Nov 22, 2025

210

China has become the largest provider of global open-source AI large models

At OpenAtom 2025, Ni Guangnan highlighted China's leading role in open-source AI models like Qwen, DeepSeek, and Kimi, which excel globally. He emphasized open-source tech's vital role in advancing AI innovation worldwide.....

Nov 21, 2025

150

MOSS-Speech Open Source: China's First Speech-to-Speech Large Model, Bypassing Text Intermediate

The MOSS team from Fudan University released MOSS-Speech, which realizes end-to-end speech dialogue for the first time. The model is now available and open-sourced on Hugging Face. It adopts a 'layer splitting' architecture, freezing the original text model and adding new layers for speech understanding, semantic alignment, and vocoder. It can complete speech Q&A, emotional imitation, and laughter generation in one step, without the traditional three-step process. Evaluation results show that the word error rate has been reduced to 4.1% in the ZeroSpeech2025 task, and the emotion recognition accuracy reached 91.2%.

Nov 20, 2025

210

Volc Engine Ranks First in China and Fifth Globally in Gartner's Report on On-Premise Capabilities

Gartner released its first Magic Quadrant for AI Development Platforms, with Volc Engine ranked as the top challenger. It has the fifth strongest on-premise capabilities globally and first in China. Its strengths lie in a complete loop of models, tools, computing power, and scenarios, enabling leading customers in industries such as consumer goods and finance to quickly build multimodal applications. By the first half of 2025, Volc Engine's market share in large model services on public clouds in China reached 49.2%, capturing nearly half of the Chinese market.

Nov 20, 2025

140

Lambda Successfully Raises $1.5 Billion to Support Large-Scale AI Infrastructure Development

Lambda raised over $1.5B in Series E funding to build large-scale AI factories, expanding its GPU-as-a-service offerings amid competition with rivals like CoreWeave. This marks its second funding round in 2025.....

Nov 20, 2025

120

Meta Opensources SAM 3D: Generate Interactive 3D Models in Seconds from a Single Image, Revolutionizing Spatial Understanding

Meta AI's SAM3D generates textured 3D assets from single 2D photos, outperforming existing methods with physics-aware geometry and materials for AR/VR, robotics, and film.....

Nov 20, 2025

220

AI Daily: Google Gemini 3 Series Models Released; Cloudflare File Issue Causes Global Downtime; Baidu's Q3 AI Revenue Reaches 9.6 Billion Yuan

Google launches Gemini 3 Pro Preview on AI Studio, enabling developers to customize parameters and experience the latest large language model.....

Nov 19, 2025

230

Claude's Three Models Launch on Azure, Anthropic Expands Strategic Cooperation with Microsoft

Claude 4.5 models launch public beta on Azure, offering serverless API access under existing commitments. Features include multi-language SDKs, Entra authentication, Claude Code assistant, Excel integration for formulas/analysis, and Copilot Studio multi-step research agents.....

Nov 19, 2025

220

Weibo Open Sources Vibe Thinker: 1.5 Billion Parameters Outperform DeepSeek R1 with a Training Cost of Only $7,800

Weibo launches the open-source large model Vibe Thinker, which has only 1.5 billion parameters but outperforms the 671 billion parameter DeepSeek R1 in mathematical competition benchmarks, with higher accuracy and a training cost of only $7,800. It adopts a lightweight MoE architecture and knowledge distillation technology, requiring only 5GB of mathematical corpus for fine-tuning. It supports downloading from Hugging Face and commercial use. The model performs outstandingly in international math competitions such as AIME.

Nov 18, 2025

170

Meta Chief AI Scientist Yann LeCun Is Planning to Leave and Start a Company: Betting on World Models to Challenge the LLM Approach

Meta's chief AI scientist Yann LeCun is leaving to start a venture focused on developing a 'world model' AI, seeking investment for goal-driven AI commercialization, challenging Meta's large language model strategy.....

Nov 18, 2025

160

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Large Models Enter a New Phase of 'Cost Reduction' After Trillion Scale

脑极体

This article is from AIbase Daily

AI News Recommendations

Xiaomi Launches Embodied Large Model MiMo-Embodied and Opens Source

China has become the largest provider of global open-source AI large models

MOSS-Speech Open Source: China's First Speech-to-Speech Large Model, Bypassing Text Intermediate

Volc Engine Ranks First in China and Fifth Globally in Gartner's Report on On-Premise Capabilities

Lambda Successfully Raises $1.5 Billion to Support Large-Scale AI Infrastructure Development

Meta Opensources SAM 3D: Generate Interactive 3D Models in Seconds from a Single Image, Revolutionizing Spatial Understanding

AI Daily: Google Gemini 3 Series Models Released; Cloudflare File Issue Causes Global Downtime; Baidu's Q3 AI Revenue Reaches 9.6 Billion Yuan

Claude's Three Models Launch on Azure, Anthropic Expands Strategic Cooperation with Microsoft

Weibo Open Sources Vibe Thinker: 1.5 Billion Parameters Outperform DeepSeek R1 with a Training Cost of Only $7,800

Meta Chief AI Scientist Yann LeCun Is Planning to Leave and Start a Company: Betting on World Models to Challenge the LLM Approach

GEO Services