Kimi Launches New SOTA Model: k1.5 Multimodal Thinking Model Debuts

AIbase基地

Published inAI News · 5 min read · Jan 21, 2025

753

Recently, Beijing Moonlit Dark Side Technology Co., Ltd. announced a significant technological upgrade for its intelligent assistant Kimi, introducing the new k1.5 multimodal thinking model. This model achieves industry-leading levels in multimodal reasoning and general reasoning capabilities, marking another breakthrough for Kimi in the field of artificial intelligence.

The k1.5 multimodal thinking model is the third major upgrade of Kimi's k series reinforcement learning models within just three months. Following the release of the k0-math mathematical model in November last year and the k1 visual thinking model in December, the k1.5 model has performed excellently in benchmark tests. In the short-CoT mode, k1.5's mathematical, coding, visual multimodal, and general capabilities significantly surpass the levels of globally recognized short reasoning SOTA models GPT-4o and Claude3.5Sonnet by as much as 550%. In the long-CoT mode, k1.5's mathematical, coding, and multimodal reasoning capabilities also reached the level of the long reasoning SOTA model OpenAI o1 official version, making it the first company outside of OpenAI to achieve multimodal reasoning performance equivalent to the o1 official version.

This upgrade is the result of the relentless efforts and innovations of the Kimi technical team. The team has publicly released a detailed model training technical report titled "Kimi k1.5: Scaling Reinforcement Learning with Large Language Models," documenting the exploration of model training under the new technological paradigm.

WeChat Screenshot_20250121082016.png

The report highlights key innovations of the k1.5 model, including long context extension, which improves training efficiency through partial unfolding techniques, and notes that increasing context length can continuously enhance model performance. Additionally, improved strategy optimization methods and a streamlined framework design support the model's strong performance. Notably, the k1.5 model was jointly trained on text and visual data, enabling joint reasoning across both modalities, particularly excelling in mathematical capabilities, although challenges remain in handling geometry problems that rely on graphic understanding.

To further enhance short-chain reasoning capabilities, the team also proposed an effective long2short method, utilizing Long-CoT technology to improve the Short-CoT model, achieving significant results in tests such as AIME, MATH500, and LiveCodeBench, far surpassing existing short-chain reasoning models like GPT-4 and Claude Sonnet3.5.

The preview version of the k1.5 multimodal thinking model will gradually roll out on the Kimi.com website and the latest version of the Kimi intelligent assistant app. Users can experience this newly upgraded model by finding the model switch button during use. The k1.5 model excels in deep reasoning, helping users solve complex coding issues, mathematical problems, and work-related challenges.

Moonlit Dark Side Technology Co., Ltd. stated that in 2025, it will continue to accelerate upgrades to the k series reinforcement learning models along the established roadmap, bringing more modalities, capabilities in more fields, and stronger general capabilities to unlock more possibilities for users.

GitHub report link: https://github.com/MoonshotAI/kimi-k1.5

Blind People Can Also See Street Scenes? Google's New AI System Makes Virtual Exploration Accessible, Marking a Key Step in Technology for Good

Google has launched the StreetReaderAI prototype system, helping blind and low-vision users to independently explore Google Street View through natural language interaction. The system integrates computer vision, geographic information systems, and large language models, enabling a multimodal AI-driven real-time conversational street view experience, breaking through the limitations of traditional voice announcements and enhancing the freedom of accessible urban exploration.

China Academy of Information and Communications Technology's Artificial Intelligence Institute Jointly Released 'Research Report on the Application of Large Model Integrated Machines (2025)'

China Academy of Information and Communications Technology and the Artificial Intelligence Industry Development Alliance released 'Research Report on the Application of Large Model Integrated Machines (2025)', analyzing technical evolution, industry dynamics, and application practices, providing enterprises with comprehensive references. The report outlines the development history of large model integrated machines, highlights significant progress in recent years, and focuses on changes at the technical level.

Moonshot AI Launches Kimi Linear: 6 Times Faster Linear Attention Architecture, Open-Source KDA Kernel Released Simultaneously

The domestic team Moonshot AI released the technical report on the Kimi Linear architecture, proposing a hybrid linear architecture that can replace the full attention mechanism. This architecture achieves breakthroughs in speed, memory efficiency, and long context processing, significantly reducing the use of KV cache, combining efficiency with performance advantages, and is called the new starting point for attention mechanisms in the era of intelligent agents.

Canva Launches a New Creative Operating System, Fully Upgrading Digital Marketing Tools

Canva launches new digital marketing and video editing tools based on the world's first 'Design AI Model', upgrading its visual suite products, positioning them as a creative operating system for marketing teams. This term does not refer to a traditional operating system, but rather a comprehensive system integrating task tools, AI support, and platform interface.

OpenAI Launches Aardvark: An Intelligent Security Research Assistant to Enhance Software Protection

OpenAI has launched Aardvark, an intelligent security assistant based on GPT-5, to help developers and security teams efficiently address the challenge of thousands of new vulnerabilities each year. The tool continuously analyzes source code, automatically identifies vulnerabilities, assesses risks, prioritizes them, and provides remediation solutions, significantly improving the efficiency of software security protection.

Grokipedia After Waterloo, AI Encyclopedia SciencePedia Emerges! Reconstructing Scientific Learning with Knowledge Graphs + Long Reasoning Chains

SciencePedia partners with top institutions to build an intelligent knowledge platform focused on deep understanding. It moves beyond traditional encyclopedias by revealing the logical structure and evolution of scientific knowledge, helping learners systematically master scientific thinking and transition from isolated facts to coherent cognition.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Kimi Launches New SOTA Model: k1.5 Multimodal Thinking Model Debuts

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Moonshot Introduces a New Hybrid Linear Attention Architecture Kimi Linear

Release of the New Generation AI Video Generation Model LTX-2: One-Click Generation of High-Quality Narrative Videos

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

Blind People Can Also See Street Scenes? Google's New AI System Makes Virtual Exploration Accessible, Marking a Key Step in Technology for Good

Moonshot Launches Kimi Linear Architecture: KV Cache Reduced by 75%, Inference Speed Increased by 6 Times, Attention Mechanism Sees Groundbreaking Innovation!

China Academy of Information and Communications Technology's Artificial Intelligence Institute Jointly Released 'Research Report on the Application of Large Model Integrated Machines (2025)'

Moonshot AI Launches Kimi Linear: 6 Times Faster Linear Attention Architecture, Open-Source KDA Kernel Released Simultaneously

Canva Launches a New Creative Operating System, Fully Upgrading Digital Marketing Tools

OpenAI Launches Aardvark: An Intelligent Security Research Assistant to Enhance Software Protection

Grokipedia After Waterloo, AI Encyclopedia SciencePedia Emerges! Reconstructing Scientific Learning with Knowledge Graphs + Long Reasoning Chains

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Kimi Launches New SOTA Model: k1.5 Multimodal Thinking Model Debuts

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Moonshot Introduces a New Hybrid Linear Attention Architecture Kimi Linear

Release of the New Generation AI Video Generation Model LTX-2: One-Click Generation of High-Quality Narrative Videos

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

Blind People Can Also See Street Scenes? Google's New AI System Makes Virtual Exploration Accessible, Marking a Key Step in Technology for Good

Moonshot Launches Kimi Linear Architecture: KV Cache Reduced by 75%, Inference Speed Increased by 6 Times, Attention Mechanism Sees Groundbreaking Innovation!

China Academy of Information and Communications Technology's Artificial Intelligence Institute Jointly Released 'Research Report on the Application of Large Model Integrated Machines (2025)'

Moonshot AI Launches Kimi Linear: 6 Times Faster Linear Attention Architecture, Open-Source KDA Kernel Released Simultaneously

Canva Launches a New Creative Operating System, Fully Upgrading Digital Marketing Tools

OpenAI Launches Aardvark: An Intelligent Security Research Assistant to Enhance Software Protection

Grokipedia After Waterloo, AI Encyclopedia SciencePedia Emerges! Reconstructing Scientific Learning with Knowledge Graphs + Long Reasoning Chains

GEO Services