AI Daily: Surpassing o1! Domestic Large Model DeepSeek R1 Open-sourced; Kimi Multimodal Thinking Model k1.5 Debuts; Qingying 2.0 Launched with Zhipu Qingen

站长之家

Published inAI News · 16 min read · Jan 21, 2025

415

Welcome to the 【AI Daily】 section! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the hottest topics in the AI field, focusing on developers to help you gain insights into technological trends and innovative AI product applications.

Fresh AI Products Click to Learn More: https://top.aibase.com/

1. Breakthrough for Domestic Large Models! DeepSeek R1 is Open-Sourced, Performance Rivals OpenAI, Ushering in a New Era of AI Equality

DeepSeek has recently released and open-sourced its latest large language model R1, marking a significant breakthrough in domestic AI technology. The model performs comparably to OpenAI's official version o1, especially excelling in key tasks such as mathematics, coding, and natural language reasoning.

【AiBase Highlights:】
🌟 DeepSeek R1 applies reinforcement learning techniques during post-training, significantly enhancing reasoning capabilities.
📊 Open-sourced the 660B parameter DeepSeek-R1 and DeepSeek-R1-Zero models, while also providing 6 smaller models, enriching the open-source ecosystem.
💰 API pricing is more competitive, with a cache hit costing only 1 yuan per million input tokens, encouraging commercial use.
Details link: https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf

2. The Dark Side of the Moon Releases New Generation SOTA Model k1.5: Multimodal Reasoning Capability Upgraded

The company The Dark Side of the Moon has launched the k1.5 multimodal thinking model, marking a significant breakthrough in multimodal reasoning and general reasoning fields. This model boasts excellent multimodal processing capabilities, able to simultaneously handle text, images, and sound, enhancing its understanding and response capabilities for complex tasks. The powerful general reasoning ability of k1.5 makes it perform exceptionally well in various application scenarios such as programming and mathematical problem-solving.

【AiBase Highlights:】
🌟 The k1.5 model possesses outstanding multimodal reasoning capabilities, able to process text, images, and sound information simultaneously.
🤖 Its strong general reasoning ability makes k1.5 suitable for a variety of tasks, such as programming and mathematics, offering high flexibility.
📱 The preview version of the k1.5 model is now live on Kimi.com and the Kimi Smart Assistant App, allowing users to experience new features.

3. Free Trial! Zhipu Launches AI Video Product Qingshadow 2.0 Now Fully Available on Zhipu Qinyan

Beijing Zhipu Huazhang Technology Co., Ltd. has launched the AI video product Qingshadow 2.0, which has been comprehensively upgraded to significantly enhance model capabilities and video generation quality. The new version can generate natural and smooth actions and stunning visuals, allowing users to create complex scenes with simple prompts. Additionally, Qingshadow 2.0 has made breakthroughs in artistic styles, supporting the generation of videos in various styles.

【AiBase Highlights:】
🚀 The foundational model capability of Qingshadow 2.0 has improved by 38%, generating natural and smooth video content.
🎨 The new version supports video generation in various artistic styles, enhancing visual appeal.
💡 Users can achieve complex scenes with simple prompts, showcasing creativity and stability.
Details link: https://chatglm.cn/video?lang=zh

4. Doubao App Launches New Voice Mode, Ahead of GPT-4o for Singing and Role-Playing

The latest release of the Doubao App features an "end-to-end" voice large model with significant updates to real-time voice calling functionality, marking a major breakthrough in voice interaction. The new model integrates voice recognition, understanding, and generation capabilities, exhibiting human-like expression and emotional output, enhancing the intelligence of conversations. The new personality modes increase interaction fun, expanding Doubao's applications in emotional companionship and psychological counseling.

【AiBase Highlights:】
🎶 The new "end-to-end" voice large model integrates voice recognition, understanding, and generation, improving conversation fluency.
🌟 The newly added "Soul Singer" and "Versatile Star" modes allow Doubao to sing and role-play, showcasing unique personality.
🤖 The new personality modes "Angry Little Bag" and "Compliment Master" enhance interaction fun, expanding AI's application scenarios.

5. OpenAI to Launch AI Tool "Operator" That Can Control Computers

OpenAI is developing an AI tool called "Operator," expected to be released in January 2025. This tool can autonomously control personal computers, performing tasks such as coding and travel booking. Although it performs well in certain safety assessments, its success rate in task execution is still lower than that of humans, and experts express concerns about its potential safety risks. Market analysis predicts that the AI agent market will grow rapidly in the coming years.

【AiBase Highlights:】
🔍 OpenAI's "Operator" tool will have the capability to autonomously control computers and perform various tasks.
🛠️ Despite "Operator" performing poorly in certain tasks, its success rate remains relatively low.
⚠️ Experts express concerns about the potential safety risks of "Operator," even though it performs well in safety assessments.

6. Support for Chinese Fonts! Meitu WHEE's "AI Poster" Feature Coming Soon

Meitu has recently announced the upcoming launch of the "AI Poster" feature in the WHEE app, aiming to simplify the poster-making process through AI technology. Users can generate various styles of posters by simply entering a sentence, with strong support for Chinese fonts to meet personalized needs. Additionally, this feature offers powerful custom layout capabilities, covering multiple core scenarios to help users design efficiently.

【AiBase Highlights:】
🎨 Users can generate various styles of posters through simple input, supporting Chinese fonts.
🛠️ Provides powerful custom layout capabilities suitable for multiple scenarios such as movies and e-commerce.
✨ The "No-Cut Material" feature is now live, supporting the generation of customized PNG materials in various styles.

7. Baidu Wenku's AI Function Monthly Active Users Exceed 90 Million, Paid Users Over 40 Million

During Baidu's recent AI Open Day event, Baidu Vice President Wang Ying shared significant progress in the application of AI technology in Baidu Wenku. The platform's monthly active users have surpassed 90 million, with paid users exceeding 40 million, demonstrating the strong appeal of AI features. In the past year, Baidu Wenku has added over 100 AI features, including intelligent PPT and full-network search tools, greatly enhancing users' document processing and learning experiences.

【AiBase Highlights:】
📈 Monthly active users exceed 90 million, with daily active users increasing by 230% year-on-year, showcasing the platform's strong appeal.
🛠️ Over 100 new AI features added, including intelligent PPT and full-network search, meeting diverse user needs and improving document processing efficiency.
🎨 The 'Free Canvas' feature is now in public beta, supporting multi-task parallel processing, simplifying the creation process, and enhancing user experience.

8. The World's First Chatbot ELIZA Revived, Originating from 60-Year-Old Code

Recently, a research team from the United States and the UK successfully revived the code of the first electronic chatbot ELIZA in history. This code was originally written by MIT professor Joseph Weizenbaum in the 1960s. After discovering the original code, researchers made technical adjustments to get it running again, despite some issues such as the program crashing when inputting numbers.

【AiBase Highlights:】
🗨️ ELIZA is the first electronic chatbot, with code written by Joseph Weizenbaum in the 1960s.
💻 The research team successfully revived this code and resolved multiple technical issues, allowing it to run normally.
📜 ELIZA holds significant importance in computer history, being regarded as the pioneer of chatbots.

9. Chinese Research Team Releases VideoChat-Flash, Long Video Processing Speed Increased by 100 Times

A Chinese research team has launched the VideoChat-Flash system, utilizing hierarchical video tagging compression technology HiCo, significantly enhancing the efficiency of long video processing. This technology reduces redundant information, lowers computational demands, and enhances the model's understanding capabilities. Experimental results show that this system performs excellently across multiple benchmark tests, becoming an advanced model in the field of long video processing.

【AiBase Highlights:】
🌟 Researchers proposed the hierarchical video tagging compression technology HiCo, significantly reducing the computational demands for long video processing.
📹 The "VideoChat-Flash" system employs a multi-stage learning approach, training with both short and long videos to enhance the model's understanding capabilities.
🔍 Experimental results show that this method has achieved new performance standards across multiple benchmark tests, becoming an advanced model in the field of long video processing.
Details link: https://arxiv.org/abs/2501.00574

10. Say Goodbye to Traditional Crawlers! Firecrawl Extract Requires No Coding to Easily Scrape Data from Any Website

The launch of Firecrawl Extract marks the gradual end of the web crawler era. With its natural language processing and powerful features, users no longer need to worry about writing crawler scripts but can focus on data analysis and application, significantly improving work efficiency. This innovative tool makes data scraping smarter and simpler, driving further development in data collection technology.

【AiBase Highlights:】
🛠️ Firecrawl Extract allows users to extract website data simply by typing prompts, eliminating the tedious programming process through natural language processing technology.
🌍 This tool supports data scraping from multilingual and international websites, capable of handling dynamically rendered JavaScript content, ensuring accurate data acquisition.
🔗 Provides API interfaces for easy integration with other applications, supporting large-scale data processing to meet big data analysis needs.
Details link: https://github.com/mendableai/firecrawl

11. Over 25% of Laptops Shipped in 2024 Will Feature Generative AI Capabilities

Counterpoint's latest market research report indicates that the global PC market will see significant growth in 2024, with shipments expected to reach 253 million units, a 2.6% increase from 2023. This growth is primarily driven by the end of support for Windows 10 and the launch of a new generation of AI laptops. Shipments in the fourth quarter of 2024 are projected to grow by 3.7% year-on-year, with increased demand for enterprise IT system upgrades, and AI laptops are set to transform user experience and drive market development.

【AiBase Highlights:】
🌍 Global PC shipments in 2024 are expected to reach 253 million units, a year-on-year increase of 2.6%.
💻 Over 25% of new laptops will feature generative AI capabilities, driving market upgrades.
📈 By 2025, AI laptops are expected to capture nearly 60% of the market share, with commercial orders likely to increase.

Aliyun's Open-Source Unified Scientific Large Model LOGOS Surpasses Microsoft with Only 1/56th of the Parameters

Alibaba's ATH-Token Foundry and Renmin University's Gaoling School of AI open-source LOGOS, a science foundation model. Using unified scientific grammar and pure sequence modeling, it matches or surpasses specialized methods on six tasks. LOGOS-1B with 1B parameters outperforms Microsoft's 8×7B model, showing extreme efficiency.....

OpenAI CEO Altman Cancels South Korea and Japan Visit Following the Birth of His Second Daughter

Sam Altman canceled his South Korea and Japan trips due to his second daughter's premature birth, dispelling speculation about government investigations or new model launches. The move reflects Silicon Valley's emphasis on work-life balance. His planned visits aimed to deepen regional cooperation.....

Tongyi Lab Collaborates to Open Source the First Unified Scientific Large Model LOGOS, 1B Parameters Outperform NatureLM

Tongyi Lab collaborated with the Institute of Artificial Intelligence at Renmin University's Gaoqin School to open-source the scientific foundation model LOGOS in 2026. It pioneered a 'unified scientific grammar,' encoding heterogeneous objects such as proteins, molecules, materials, and chemical reactions into discrete token sequences, breaking through the 'one task, one expert model' barrier in traditional AI4S, enabling cross-domain knowledge transfer and unified modeling.

Report: DeepSeek Completes A-Round Financing of 51 Billion Yuan, with Giants Like Tencent and JD.com Participating

DeepSeek completed an A-round financing of approximately 51 billion yuan, with its valuation surging to 400 billion yuan. Due to the highly promising market prospects, the financing competition was intense, and it has shifted from seeking investors to companies screening participation qualifications, with a top-tier investment team ultimately investing.

South Korea Joins Forces with OpenAI: Global AI Safety Assessment Framework Expands

The South Korean Ministry of Science and ICT signed a memorandum of understanding with OpenAI, becoming the fourth country to establish AI safety cooperation with it. The two parties will work together with the South Korea Artificial Intelligence Security Institute to jointly build a scientific and standardized global artificial intelligence security evaluation framework.

Intense Battle Among AI Giants: SpaceX Acquires Cursor for $60 Billion, OpenAI Suffered a $38.5 Billion Loss Last Year

The AI industry is experiencing accelerated capital realignment. SpaceX announced it would acquire Anysphere, the parent company of the AI coding tool Cursor, through a $6 billion stock-only deal with no cash involved, highlighting the tech giants' strong demand for AI coding capabilities. This move also reflects the financial pressure faced by top model companies despite their high growth.

Big Companies Can't Afford the Huge AI Bills! Microsoft's Intelligent Agent Considers Switching to DeepSeek's Phantombase

Due to the high costs of top-tier AI models, Microsoft is shifting its Copilot Cowork intelligent agent to a pay-per-use model and plans to introduce DeepSeek's V4 fine-tuned version from a Chinese company to reduce costs for enterprise customers and accelerate the adoption of AI tools in enterprises.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

AI Daily: Surpassing o1! Domestic Large Model DeepSeek R1 Open-sourced; Kimi Multimodal Thinking Model k1.5 Debuts; Qingying 2.0 Launched with Zhipu Qingen

站长之家

This article is from AIbase Daily

AI News Recommendations

Noam Shazeer, the Core Author of Transformer, Joins OpenAI; Google's Huge Investment Could Not Keep Him

Aliyun's Open-Source Unified Scientific Large Model LOGOS Surpasses Microsoft with Only 1/56th of the Parameters

OpenAI CEO Altman Cancels South Korea and Japan Visit Following the Birth of His Second Daughter

Tongyi Lab Collaborates to Open Source the First Unified Scientific Large Model LOGOS, 1B Parameters Outperform NatureLM

Report: DeepSeek Completes A-Round Financing of 51 Billion Yuan, with Giants Like Tencent and JD.com Participating

South Korea Joins Forces with OpenAI: Global AI Safety Assessment Framework Expands

Intense Battle Among AI Giants: SpaceX Acquires Cursor for $60 Billion, OpenAI Suffered a $38.5 Billion Loss Last Year

From Passive Q&A to Proactive Execution: ChatGPT Launches Scheduled Tasks, Accelerating the Evolution of Intelligent Assistants

OpenAI Exposed as Preparing to Launch New Dual-Directional Voice Model GPT-Bidi-1

Big Companies Can't Afford the Huge AI Bills! Microsoft's Intelligent Agent Considers Switching to DeepSeek's Phantombase

AI News Recommendations

Noam Shazeer, the Core Author of Transformer, Joins OpenAI; Google's Huge Investment Could Not Keep Him

Aliyun's Open-Source Unified Scientific Large Model LOGOS Surpasses Microsoft with Only 1/56th of the Parameters

OpenAI CEO Altman Cancels South Korea and Japan Visit Following the Birth of His Second Daughter

Tongyi Lab Collaborates to Open Source the First Unified Scientific Large Model LOGOS, 1B Parameters Outperform NatureLM

Report: DeepSeek Completes A-Round Financing of 51 Billion Yuan, with Giants Like Tencent and JD.com Participating

South Korea Joins Forces with OpenAI: Global AI Safety Assessment Framework Expands

Intense Battle Among AI Giants: SpaceX Acquires Cursor for $60 Billion, OpenAI Suffered a $38.5 Billion Loss Last Year

From Passive Q&A to Proactive Execution: ChatGPT Launches Scheduled Tasks, Accelerating the Evolution of Intelligent Assistants

OpenAI Exposed as Preparing to Launch New Dual-Directional Voice Model GPT-Bidi-1

Big Companies Can't Afford the Huge AI Bills! Microsoft's Intelligent Agent Considers Switching to DeepSeek's Phantombase