Alibaba's Image Generation Model Qwen2vl-Flux Open Sourced, Supporting Image Fusion and Style Transfer

AIbase基地

Published inAI News · 4 min read · Nov 27, 2024

635

Recently, Alibaba announced the open-source release of its latest image generation model, Qwen2vl-Flux. This model not only has various functions such as editing, merging, and blending, but it can also generate entirely new images that are highly similar when users input images or text.

Qwen2vl-Flux offers powerful image transformation capabilities. Users only need to input one image without any text prompts, and the model can generate multiple similar images based on the original. For example, if a user uploads a photo of a person, the model can produce multiple representations of the person from different angles, showcasing various perspectives and emotions.

The model also supports text-guided image blending. When a user inputs an image along with relevant text prompts, Qwen2vl-Flux can cleverly merge the input image with the text content to create new visual effects.

In addition to the above features, Qwen2vl-Flux also has the capability of image-guided image blending. Users can combine two different images to create character merges or scene transitions. For example, by merging a character with a different background, the model can seamlessly blend the two, resulting in a new visual effect.

The model's grid style transfer feature allows users to have detailed control over the images. Users can modify specific parts of an image for refined creativity. For instance, in an image that showcases the combination of high technology and natural environments, users can add details of bioluminescent technology or the effect of morning mist in the forest, creating a richer visual experience.

Project link: https://huggingface.co/Djrango/Qwen2vl-Flux?continueFlag=3e2a3aabe53334260b255e6d52dad793

Key Points:

🌟 Qwen2vl-Flux is open-source and possesses powerful image generation and editing capabilities.

🖼️ Supports image transformation and text-guided image blending to create new visual effects.

🔍 Provides image-guided image blending and grid style transfer, allowing users to have fine control.

Qwen2vl-Flux Image Generation Model Image Editing Alibaba

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Qwen APP's Grey Test of HappyHorse, A One-Tap TVB Hong Kong Style Short Film AI Video Model

On April 27th, the Qwen APP launched a grey test of the video model HappyHorse. Users can experience it by clicking the button at the bottom of the homepage. This model excels in narrative ability, audio-visual synchronization, and style diversity. During the internal testing period, a large number of TVB Hong Kong style, CCTV Three Kingdoms style, and old movie style short films were generated. Users can create similar videos with a single prompt. It is particularly skilled in producing plot-based videos, requiring only a simple description to automatically generate multi-scene content.

Apr 27, 2026

110

Shocking Hollywood! The Hot Movies You've Seen Were Actually Made by Chinese AI

Recently, 73 special effects scenes in the Amazon series 'The David Dynasty' were completed by generative AI, with the technology coming from the Chinese company Kuaishou, saving the production a significant amount of location and post-production costs. This case shows that AI video generation technology is accelerating its penetration into film production, triggering industry attention on costs, efficiency, and traditional work models.

Apr 27, 2026

OpenAI Launches Privacy Filter: New PII Anonymization Model Open-Sourced

OpenAI released the Privacy Filter model, designed to help developers anonymize personally identifiable information (PII) in text. The model has 150 million parameters and uses a Mixture of Experts (MoE) design, and is open-sourced on Hugging Face and GitHub under the Apache 2.0 license. Its core advantage lies in deep language understanding capabilities, enabling it to identify sensitive information in unstructured text through context, surpassing traditional rule-based methods.

Apr 27, 2026

110

OpenAI Adjusts Strategic Focus: Programming Model Codex Officially Integrated into GPT-5.5 Architecture

OpenAI announced the termination of the independent programming model Codex, integrating its core capabilities into the GPT-5.5 main model. This means that GPT-5.3 becomes the final standalone Codex, marking a shift in development approach from 'specialized plug-in' to 'intrinsic all-around', and developers will no longer rely on dedicated programming branches.

Apr 27, 2026

130

Tencent Launches Embodied Multimodal Large Model HY-Embodied-0.5-X to Empower Robot Intelligent Interaction

Tencent Robotics X and the Hunyuan team jointly open-source the HY-Embodied-0.5-X multimodal large model, optimized for embodied tasks of robots. This model is based on the MoT-2B architecture, enhancing the ability to 'understand, clarify, and act.' It excels in fine manipulation, spatial reasoning, action prediction, and risk assessment. The series includes two versions: MoT-2B and MoE-32B, aiming to improve robots' intelligent interaction in real-world environments.

Apr 27, 2026

120

Another Breakthrough in Domestic Chips! 5nm Longying 2 Chip Officially Released, AI Computing Power Reaches 200TOPS

At the 2026 Beijing Auto Show, Xincheng Technology launched the 5nm automotive-grade AI cockpit chip Longying 2, with AI computing power reaching 200TOPS and supporting models with over 7B parameters, marking a key breakthrough in advanced process technology and cross-domain integration for high-end domestic vehicle chips.

Apr 27, 2026

150

Sub-millimeter Precision Alignment: Xiaomi Open Sources the Full Post-Training Process of VLA Large Model

Xiaomi has recently open-sourced the real-world post-training process of its vision-language-action large model, Xiaomi-Robotics-0, promoting the development of embodied intelligence. The team enabled the robot to master precise earphone storage and other complex tasks using only about 20 hours of task data, demonstrating the ability to quickly learn complex skills.

Apr 27, 2026

200

Cracking Down! Didi Officially Reveals Its Safety AI Model, Entering a New Stage of Human-Machine Collaboration in Risk Control for Shared Rides

Didi showcased its latest advancements in safety technology during the Safe Shared Rides Open Day. Through an AI-based liability identification system, it proactively detects and automatically handles risks, effectively addressing industry pain points such as fraud prevention and unboarded ride charges, significantly improving governance effectiveness.

Apr 27, 2026

180

Anthropic Launches Project Deal: Claude Completes 186 Autonomous Transactions, Totaling Over $4,000

Anthropic quietly launched its internal experiment 'Project Deal' on April 24, 2026, showcasing the autonomous negotiation and trading capabilities of AI agents in real markets. The experiment took place in the Slack office market, where the Claude model represented 69 employees to execute buy and sell operations, facilitating 186 transactions among over 500 items, with a total transaction value exceeding $4,000. Technically, parallel market tests were used to compare the performance of different model specifications.

Apr 27, 2026

240

Upload a picture and instantly generate a 3D world. Ling Guang App brings the world model to mobile devices first

Ant Ling Guang App launches the 'World Model' feature, becoming the industry's first AGI product on mobile devices. Users can upload a picture and explore a 3D world on their phone for up to 60 seconds, supporting game-style perspective control, with exploration starting in seconds. This is the first time the world model has been implemented on the edge, featuring minute-level long-term consistency and real-time interaction capabilities.

Apr 27, 2026

130

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Alibaba's Image Generation Model Qwen2vl-Flux Open Sourced, Supporting Image Fusion and Style Transfer

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Qwen APP's Grey Test of HappyHorse, A One-Tap TVB Hong Kong Style Short Film AI Video Model

Shocking Hollywood! The Hot Movies You've Seen Were Actually Made by Chinese AI

OpenAI Launches Privacy Filter: New PII Anonymization Model Open-Sourced

OpenAI Adjusts Strategic Focus: Programming Model Codex Officially Integrated into GPT-5.5 Architecture

Tencent Launches Embodied Multimodal Large Model HY-Embodied-0.5-X to Empower Robot Intelligent Interaction

Another Breakthrough in Domestic Chips! 5nm Longying 2 Chip Officially Released, AI Computing Power Reaches 200TOPS

Sub-millimeter Precision Alignment: Xiaomi Open Sources the Full Post-Training Process of VLA Large Model

Cracking Down! Didi Officially Reveals Its Safety AI Model, Entering a New Stage of Human-Machine Collaboration in Risk Control for Shared Rides

Anthropic Launches Project Deal: Claude Completes 186 Autonomous Transactions, Totaling Over $4,000

Upload a picture and instantly generate a 3D world. Ling Guang App brings the world model to mobile devices first

AI News Recommendations

Qwen APP's Grey Test of HappyHorse, A One-Tap TVB Hong Kong Style Short Film AI Video Model

Shocking Hollywood! The Hot Movies You've Seen Were Actually Made by Chinese AI

OpenAI Launches Privacy Filter: New PII Anonymization Model Open-Sourced

OpenAI Adjusts Strategic Focus: Programming Model Codex Officially Integrated into GPT-5.5 Architecture

Tencent Launches Embodied Multimodal Large Model HY-Embodied-0.5-X to Empower Robot Intelligent Interaction

Another Breakthrough in Domestic Chips! 5nm Longying 2 Chip Officially Released, AI Computing Power Reaches 200TOPS

Sub-millimeter Precision Alignment: Xiaomi Open Sources the Full Post-Training Process of VLA Large Model

Cracking Down! Didi Officially Reveals Its Safety AI Model, Entering a New Stage of Human-Machine Collaboration in Risk Control for Shared Rides

Anthropic Launches Project Deal: Claude Completes 186 Autonomous Transactions, Totaling Over $4,000

Upload a picture and instantly generate a 3D world. Ling Guang App brings the world model to mobile devices first