Can You Lose 90% Weight and Still Fly? ControlNeXt Lets Iron Man Dance with Beautiful Moves with Finger Precision

AIbase基地

Published inAI News · 3 min read · Aug 18, 2024

251

The latest release from the team of Professor Jia JiaYa at CUHK, ControlNeXt, is nothing short of a "weight-loss miracle" in the AI world! This open-source image/video generation guidance tool is not only compact in size, perfectly compatible with common models from the Stable Diffusion family such as SDXL, SD1.5, etc., but also plug-and-play, significantly simplifying the usage process.

ControlNeXt supports a variety of control modes, including edge guidance, pose control, masking, and depth of field control. It can even make Iron Man perform a beautiful dance with precise movements down to the fingers, showcasing its powerful control capabilities.

The "weight-loss secret" of ControlNeXt lies in its clever removal of the "glutton" control branches from ControlNet, replacing them with a "light meal" consisting of a few ResNet blocks. This compact module, although only one-tenth the size of the original, can perfectly extract features of various control conditions.

QQ截图20240818145321.png

Moreover, ControlNeXt is a "learning prodigy." It can master new skills in just 400 steps, whereas ControlNet requires several thousand steps. In terms of generation speed, ControlNeXt is unrivaled, only adding a 10.4% delay, compared to the 41.9% delay of ControlNet.

Another "signature move" of ControlNeXt is cross-normalization. This technique is like throwing a "networking party" for the features, aligning their data distributions to avoid sensitivity to parameter initialization, and allowing control conditions to take effect early in the training process.

ControlNeXt is like the "Transformer" in the AI world, compact and flexible yet powerful. It can not only make two-dimensional girls perfectly align with control lines but also create characters from different dimensions with unique styles. With this miraculous tool, we can soon expect to see more astonishing AI art works!

Project Homepage: https://pbihao.github.io/projects/controlnext/index.html

ControlNeXt AI Field Weight Loss Tool StableDiffusion

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Synthesia 3.0 Major Update: Introducing Video Avatars for Real-Time Interaction with Viewers, Dialogue, and Q&A

Synthesia launches version 3.0 of its video avatar platform, featuring a core new function called "Video Avatar." These virtual avatars can interact with viewers in real-time during videos, including dialogue, answering questions, and raising queries, and can access company-specific information, significantly enhancing the practicality and realism of scenarios such as corporate training and customer service.

Oct 6, 2025

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Meta announced that starting December 16, 2025, all text or voice conversations between users and Meta AI will be integrated into its advertising and content algorithms. This means that interactions in AI chats will directly influence the ads, posts, and group content that users see on platforms such as Facebook and Instagram. For example, after discussing hiking, users will receive more related ads and content in their feeds.

Oct 6, 2025

Anthropic Language Model Emerges as a New Force in Cybersecurity: Claude 4.5 Demonstrates Significant Improvement in Vulnerability Discovery

Anthropic demonstrates breakthroughs of its large language model in the field of cybersecurity. The latest Claude Sonnet4.5 has a 5% probability of discovering software vulnerabilities, a significant increase from 2% in its predecessor Sonnet4. It has been proven through CyberGym tests that AI can efficiently enhance network defense, highlighting the potential of technological advancements.

Oct 6, 2025

Computing Bottlenecks and Privacy Dilemmas: OpenAI's New AI Device May Be Delayed

OpenAI collaborates with LoveFrom to develop AI hardware, aiming to surpass Echo with proactive learning and natural integration, inspired by 'Her'.....

Oct 6, 2025

Google Makes a Big Move! Gemini CLI Integrates with MCP, Developers Say Goodbye to Configuration Hell

Google's open-source tool Gemini CLI is deeply integrated with the FastMCP framework, allowing developers to install and configure an MCP server with a single command, significantly simplifying the tedious development process that traditionally required manual environment configuration, dependency handling, and debugging of transmission channels.

Oct 4, 2025

330

The AI Design Tool Invested by Sequoia Has Fallen! Acquired by Perplexity and Shut Down 90 Days Later

Visual Electric, an AI design startup, was acquired by Perplexity. The product will shut down in 90 days, with the team joining Perplexity's new 'Agent Experience' division. Deal terms undisclosed.....

Oct 4, 2025

120

How Developers Can Use Apple's Local AI Models in iOS 26

iOS 26's Foundation Models enables offline AI model usage, boosting apps like Lil Artist with AI storytelling features.....

Oct 4, 2025

110

OpenAI New App Sora Rises to the Top of Apple App Store in Four Days

OpenAI's new video generation app, Sora, topped the Apple App Store free chart within four days of its release, surpassing Google Gemini and its own ChatGPT. The app allows users to generate, edit, and share short videos. It is currently available for invitation-only testing by iOS users in the US and Canada. Market reactions indicate strong demand for AI video tools.

Oct 4, 2025

260

Mickey Mouse Goes Offline! Character.AI Faces Legal Letter from Disney, Removes All Disney-related Characters

Disney sent a legal notice to Character.AI, demanding removal of Mickey Mouse and other characters, citing copyright infringement. The characters were removed within 24 hours. Disney accused the company of exploiting its century-old brand reputation.....

Oct 3, 2025

190

Free Browser Add-on is Here! Perplexity Brings Comet, Which Costs $200/Month, to Everyone. An AI Assistant That Helps You Browse the Web, Write Emails, Book Tickets, and Compare Prices is Now Available

Perplexity's free AI browser Comet features a sidebar assistant for multi-tasking like flight comparisons and email replies without tab-switching. Initially paywalled, its global free launch caused a download surge, briefly crashing servers. Designed to boost efficiency and reduce user workload.....

Oct 3, 2025

250

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Can You Lose 90% Weight and Still Fly? ControlNeXt Lets Iron Man Dance with Beautiful Moves with Finger Precision

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Synthesia 3.0 Major Update: Introducing Video Avatars for Real-Time Interaction with Viewers, Dialogue, and Q&A

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Anthropic Language Model Emerges as a New Force in Cybersecurity: Claude 4.5 Demonstrates Significant Improvement in Vulnerability Discovery

Computing Bottlenecks and Privacy Dilemmas: OpenAI's New AI Device May Be Delayed

Google Makes a Big Move! Gemini CLI Integrates with MCP, Developers Say Goodbye to Configuration Hell

The AI Design Tool Invested by Sequoia Has Fallen! Acquired by Perplexity and Shut Down 90 Days Later

How Developers Can Use Apple's Local AI Models in iOS 26

OpenAI New App Sora Rises to the Top of Apple App Store in Four Days

Mickey Mouse Goes Offline! Character.AI Faces Legal Letter from Disney, Removes All Disney-related Characters

Free Browser Add-on is Here! Perplexity Brings Comet, Which Costs $200/Month, to Everyone. An AI Assistant That Helps You Browse the Web, Write Emails, Book Tickets, and Compare Prices is Now Available

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Can You Lose 90% Weight and Still Fly? ControlNeXt Lets Iron Man Dance with Beautiful Moves with Finger Precision

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Synthesia 3.0 Major Update: Introducing Video Avatars for Real-Time Interaction with Viewers, Dialogue, and Q&A

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Anthropic Language Model Emerges as a New Force in Cybersecurity: Claude 4.5 Demonstrates Significant Improvement in Vulnerability Discovery

Computing Bottlenecks and Privacy Dilemmas: OpenAI's New AI Device May Be Delayed

Google Makes a Big Move! Gemini CLI Integrates with MCP, Developers Say Goodbye to Configuration Hell

The AI Design Tool Invested by Sequoia Has Fallen! Acquired by Perplexity and Shut Down 90 Days Later

How Developers Can Use Apple's Local AI Models in iOS 26

OpenAI New App Sora Rises to the Top of Apple App Store in Four Days

Mickey Mouse Goes Offline! Character.AI Faces Legal Letter from Disney, Removes All Disney-related Characters

Free Browser Add-on is Here! Perplexity Brings Comet, Which Costs $200/Month, to Everyone. An AI Assistant That Helps You Browse the Web, Write Emails, Book Tickets, and Compare Prices is Now Available

GEO Services