Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Tools

GEO Brand Visibility

All-in-One GEO Brand Insights Platform

AI Visibility Audit

Quickly check how your brand is perceived and presented in AI-powered search results.

AI Search Visibility Checker

Detect brand's visibility on AI platforms

AI Conversation Insight

Discover trending questions users ask AI to guide content strategy

GEO Promotion Link Detection

Quickly evaluate the citation of promotion articles on AI platforms

Service

GEO Ranking Optimization System

Own your own GEO system and become a professional GEO optimization service provider.

GEO Ranking Optimization

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

Information

LLM API Hub

One-stop integration for all major LLM APIs.

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

AI Marketplace

Doubao Large Model Family Upgraded, Launches Powerful Visual Understanding Model

AIbase基地

Published inAI News · 6 min read · Dec 18, 2024

650

At the Volcano Engine FORCE Power Conference on December 18, 2024, Volcano Engine announced a comprehensive upgrade to the Doubao large model family and launched a brand new visual understanding model.

Tan Dai, the president of Volcano Engine, stated that the daily token usage of the Doubao large model has surged in the past few months, exceeding 4 trillion tokens, a 33-fold increase compared to its release in May. This growth trend indicates the widespread use of the Doubao large model across various application scenarios.

With the launch of the visual understanding model, users can input both text and image questions simultaneously, allowing the model to comprehend and provide accurate answers. This innovation will significantly simplify the application development process and unlock the potential of large models in more scenarios.

The visual understanding model possesses enhanced content recognition capabilities, allowing it to identify basic elements such as object categories and shapes in images, as well as understand the relationships between objects, spatial layouts, and the overall meaning of scenes. For example, it can recognize shadows and understand natural knowledge.

The visual understanding model also features stronger understanding and reasoning abilities, enabling it to better recognize content and perform complex logical calculations based on the identified text and image information, such as chart reasoning and physical reasoning.

Additionally, it has a more refined visual description capability, allowing for detailed descriptions of the content presented in images and enabling various forms of creative writing, such as image creation and image poetry.

The Doubao visual understanding model shows broad application prospects in various fields such as education, tourism, and e-commerce. For instance, in education, the model can help students optimize their essays and scientific knowledge; in tourism, it can provide translations of foreign menus and explanations of architectural backgrounds for tourists; in e-commerce marketing, it can assist merchants in detailing product features, thus improving advertising effectiveness.

The usage cost of the visual understanding model is very affordable, with a price of 0.003 yuan per thousand tokens, which is 85% lower than the industry average. This pricing allows for the processing of up to 284 images at 720P for every yuan spent, marking the entry of visual understanding technology into the "cent era." Furthermore, Volcano Engine offers up to 15,000 initial traffic supports for enterprises and developers to better utilize this technology.

At this conference, Volcano Engine not only launched the visual understanding model but also upgraded several other models. The comprehensive task handling capability of the Doubao general model pro has improved by 32% since May, with significant enhancements in reasoning, instruction following, coding, and mathematics. Meanwhile, the Doubao video generation model will be available for external service in January 2025, and enterprises can make reservations to use it.

To enhance enterprises' information acquisition and search recommendation capabilities, Volcano Engine also launched a comprehensive AI search service, helping businesses better connect information with user needs and facilitating the intelligent transformation of various industries.

Key Points:
🔍 The daily token usage of the Doubao large model has reached 4 trillion, a 33-fold increase since May.
💡 The newly launched visual understanding model supports simultaneous input of text and images, applicable in education, tourism, and e-commerce.
💰 The usage cost is only 0.003 yuan per thousand tokens, significantly lower than the industry average.

FireMountainGuide BeanBagLargeModel VisualInterpretationModel AINewVocabulary

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Volcano Engine Launches Next-Generation Automotive AI Solution, Over 7 Million Vehicles Equipped with Doubao Large Model

At the 2026 Beijing Auto Show, Volcano Engine unveiled a next-generation automotive AI solution based on Agentic AI architecture, including AI cockpit suites and a Doubao cockpit assistant. This solution aims to upgrade smart cockpits from 'voice interaction' to an autonomous 'car brain' with thinking and execution capabilities. Yang Liwei, VP of Volcano Engine, stated that the upgrade leverages three core engines to break existing cockpit bounda....

Apr 28, 2026

440

Dongfeng Motor and Volcano Engine Reach Strategic Cooperation to Promote the Application of AI Technology in the Automotive Industry

Dongfeng Motor and ByteDance's Volcano Engine have signed a strategic cooperation agreement, combining Dongfeng's expertise in vehicle R&D and manufacturing with Volcano Engine's AI and cloud computing strengths. They will focus on intelligent cockpits, enterprise digital transformation, and AI cloud platform development, promoting the application of the Doubao large model across the automotive supply chain.....

Apr 24, 2026

280

Daily Consumption Exceeds 12 Trillion! ByteDance's Douyin Large Model Becomes the Traffic King: Surged 1000 Times in Two Years

Doubao AI model's daily token usage exceeds 120 trillion, setting a new industry record and showcasing ByteDance's strong penetration in AI applications. Usage has doubled in the past three months, with significant growth compared to two years ago.....

Apr 2, 2026

700

Volcano Engine FORCE Conference Unleashes: Douba Large Model 1.8 + Seedance 1.5 Pro Released, Daily 50 Trillion Tokens Reach the Top of China

ByteDance unveils Doubao 1.8 and Seedance 1.5 Pro at Volcano Engine Conference, launching 'AI Savings Plan' to cut enterprise costs, with enhanced reasoning, multilingual support, and improved video generation.....

Dec 18, 2025

2.3k

Volcanic Engine Reveals the Achievements of Doubao Large Model, 417-Time Growth Opens the Era of AI Mass Production

By December 2025, Doubao AI model's daily token usage exceeded 50 trillion, a 417-fold increase since May 2024, with over 100 enterprises utilizing it via Volcano Engine.....

Dec 18, 2025

970

The New Era of AI Has Arrived! Yangcong Academy Launches the Self-Learning Breakthrough Plan to Open a New Chapter in Autonomous Learning

Onion Academy launched the 'Self-Study Breakthrough Plan 1.0' in Beijing, using AI-driven multi-agent collaboration to cultivate students' autonomous learning and drive transformative education reform.....

Nov 6, 2025

560

ByteDance Releases Dou Bao Large Model 1.6: The First Domestic Model Supporting Adjustable Thinking Depth

ByteDance's Volcano Engine launches Doubao 1.6, China's first adjustable-length AI model. Features four thinking-depth options to balance output quality and response speed. Key innovation: 77.5% fewer tokens consumed in low-speed mode.....

Oct 17, 2025

1.4k

Douyin's Duanbao Large Model: Daily Calls Exceed 30 Trillion Tokens, Rapid Growth is Remarkable!

Doubao model's usage surged from 120B tokens in May 2024 to over 30T tokens by Sept 2025, a 253x growth, reflecting rapid adoption across industries.....

Oct 16, 2025

990

B站 Launches AI Voice Translation Feature: Retains Uploader's Voice Tone, Solving the Challenge of Anime Culture Going Global

Aug 4, 2025

1.2k

ByteDance Doubao Large Model Daily Usage Surges 137 Times, Launches Multiple New Products

Volc Engine announced that the daily tokens usage of Doubao Large Model exceeded 1.6 trillion, growing 137 times from last year, with a market share of 46.4%. At the launch event, new products such as Image Editing 3.0 and Simultaneous Interpretation 2.0 were introduced, and the large model was upgraded to version 1.6. President Tan Dai pointed out that AI is shifting from a tool to an intelligent agent, which will significantly change ways of living and working. IDC predicts that the public cloud large model usage in China will reach 114.2 trillion tokens in 2024. Volc Engine expects revenue to exceed 12 billion yuan in 2024. New products...

Jul 31, 2025

1.6k