DeepSeek unleashes a new surprise in the late night with the launch of the new multimodal model Janus-Pro

AIbase基地

Published inAI News · 3 min read · Jan 28, 2025

935

Domestic large model DeepSeek has launched the brand new Janus-Pro multimodal large model, officially entering the text-to-image field. This move marks a significant breakthrough for DeepSeek in multimodal AI technology.

In the GenEval and DPG-Bench benchmark tests, Janus-Pro-7B not only surpassed OpenAI's DALL-E3 but also outperformed popular models such as Stable Diffusion and Emu3-Gen. Janus-Pro is released under the MIT open-source license, which means it can be used without restrictions in commercial scenarios. DeepSeek stated that Janus-Pro is an advanced version of the JanusFlow large model released on November 13, 2024.

DeepSeek releases a new multimodal large model, outperforming OpenAI

Compared to its predecessor, Janus-Pro has optimized its training strategy, expanded its training data, and increased its model size. These improvements have enabled Janus-Pro to make significant progress in multimodal understanding and text-to-image instruction tracking, while enhancing the stability of text-to-image generation.

DeepSeek releases a new multimodal large model, outperforming OpenAI

Although Janus-Pro currently can only handle images at a resolution of 384x384, it is impressive that it can achieve such quality given its compact model size.

As a multimodal model, Janus-Pro can not only generate images but also describe them, identify landmarks, recognize text within images, and provide information about the knowledge depicted in the images.

Key Points:
🌟 DeepSeek releases the Janus-Pro multimodal large model, entering the text-to-image field.
📈 In benchmark tests, Janus-Pro-7B outperforms popular models like OpenAI's DALL-E3.
✅ Janus-Pro is licensed under the MIT open-source license, allowing unrestricted use in commercial scenarios.

DeepSeek Janus-Pro MultimodalAI DALL-E3

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

iFlytek's Starfire X1 Receives Major Upgrade, Rivaling OpenAI and DeepSeek!

At the AI Boundless Innovation Global Launch Conference held in Shanghai, iFlytek announced a significant upgrade to its deep reasoning model, Starfire X1. iFlytek's senior vice president, Yu Jidong, revealed that this upgrade will further enhance Starfire X1's performance in reasoning, text generation, and language understanding, making it comparable to industry-leading models like OpenAI's o1 and DeepSeek's R1. Initially launched in January 2025, Starfire X1 distinguishes itself with its training based on entirely domestic computing power.

Apr 9, 2025

620

DeepSeek's Innovative SPCT Technology Enables LLMs to Better Understand Human Intent

DeepSeek AI, a prominent Chinese artificial intelligence research lab, following its powerful open-source language model DeepSeek-R1, has achieved another significant breakthrough in the field of Large Language Models (LLMs). Recently, DeepSeek AI officially launched an innovative technology called Self-Principled Critique Tuning (SPCT), aimed at building more general-purpose and scalable AI reward models.

Apr 9, 2025

270

Kugou Music and DeepSeek Partner to Launch a New AI-Powered Music Report

In the context of the increasing integration of AI technology into the entertainment industry, Kugou Music and DeepSeek, a leading domestic AI company, have established a strategic partnership. This collaboration leverages large language models to revolutionize music platforms, transforming them from mere "tool-based applications" into "intelligent entertainment hubs." This transformation is centered around four core AI functional modules that are comprehensively reshaping the entire music consumption experience, setting a new benchmark for AI and music integration.

Apr 8, 2025

230

Gemini Live Visual Chat Arrives on Pixel 9: AI Assistant Enters a New Era of Multimodal Interaction

Apr 8, 2025

130

Meta's Llama 4 Makes a Strong Debut, But Stumbles on Long-Context Tasks

Apr 8, 2025

320

DeepSeek and Tsinghua University Collaborate on Self-Optimizing AI Model

Amidst the growing prevalence of artificial intelligence, the collaboration between DeepSeek and Tsinghua University has garnered significant industry attention. DeepSeek, a Chinese startup, is renowned for its breakthroughs in low-cost inference models. This collaboration aims to further reduce the training costs of AI models, thereby enhancing operational efficiency. DeepSeek recently launched a new low-cost inference model that has generated considerable market excitement. To further optimize this model, DeepSeek's research team...

Apr 7, 2025

280

AI Commentator Goes Live! Deepseek-R1 Full Version Integrated into Livebar

Apr 7, 2025

170

DeepSeek and Tsinghua University Joint Research: Innovative Reward Model Inference Method Improves Scalability

Researchers from DeepSeek and Tsinghua University recently published a new paper exploring scaling methods for reward model inference, seemingly advancing DeepSeek R2. Reinforcement learning is widely used in the large-scale post-training phase of large language models, but faces the challenge of obtaining accurate reward signals for these models. The researchers found that using pointwise generative reward modeling (GRM) improves model adaptability and scalability during inference. To this end, they propose Self-Principle Calibration Tuning (SPCT) learning.

Apr 5, 2025

600

Worried about handling multiple images? Tencent Yuanbao Update: One-click Upload and Smart Processing for Multiple Images

Apr 2, 2025

390

Tuniu Launches AI Assistant Xiao Niu: Open-Source Large Model Empowers One-Stop Smart Travel Service

On April 1st afternoon, Tuniu Travel announced the official launch of its self-developed AI assistant, "Xiao Niu," a travel application agent available on both the Tuniu Travel app and the "Xiao Niu" mini-program. According to the announcement, "Xiao Niu" innovatively utilizes the open-source large models DeepSeek and Tongyi Qianwen, deeply integrating with vertical travel application scenarios to provide users with a more convenient and efficient travel experience. Through "Xiao Niu," users can easily search and book air tickets, hotels, and train tickets. Furthermore, this AI...

Apr 1, 2025

300

AI News

AI Daily

AI Timeline

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

DeepSeek unleashes a new surprise in the late night with the launch of the new multimodal model Janus-Pro

AIbase基地

This article is from AIbase Daily

AI News Recommendations

iFlytek's Starfire X1 Receives Major Upgrade, Rivaling OpenAI and DeepSeek!

DeepSeek's Innovative SPCT Technology Enables LLMs to Better Understand Human Intent

Kugou Music and DeepSeek Partner to Launch a New AI-Powered Music Report

Gemini Live Visual Chat Arrives on Pixel 9: AI Assistant Enters a New Era of Multimodal Interaction

Meta's Llama 4 Makes a Strong Debut, But Stumbles on Long-Context Tasks

DeepSeek and Tsinghua University Collaborate on Self-Optimizing AI Model

AI Commentator Goes Live! Deepseek-R1 Full Version Integrated into Livebar

DeepSeek and Tsinghua University Joint Research: Innovative Reward Model Inference Method Improves Scalability

Worried about handling multiple images? Tencent Yuanbao Update: One-click Upload and Smart Processing for Multiple Images

Tuniu Launches AI Assistant Xiao Niu: Open-Source Large Model Empowers One-Stop Smart Travel Service