Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Tools

GEO Brand Visibility

All-in-One GEO Brand Insights Platform

AI Visibility Audit

Quickly check how your brand is perceived and presented in AI-powered search results.

AI Search Visibility Checker

Detect brand's visibility on AI platforms

GEO Promotion Link Detection

Quickly evaluate the citation of promotion articles on AI platforms

Service

GEO Ranking Optimization System

Own your own GEO system and become a professional GEO optimization service provider.

GEO Ranking Optimization

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

Information

LLM API Hub

One-stop integration for all major LLM APIs.

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

Shengshu Technology Releases Vidu 1.5 Video Generation Model, Overcoming the 'Multi-Agent Consistency' Challenge

AIbase基地

Published inAI News · 4 min read · Nov 13, 2024

630

On the occasion of Vidu's launch surpassing 100 days, BioNum Technologies proudly announces the release of Vidu 1.5, a new version that achieves breakthroughs at a world-leading level, particularly in understanding diverse inputs and overcoming the "consistency" challenge.

The introduction of Vidu 1.5 marks the entry of visual models into a new "contextual" era, accelerating the advent of Artificial General Intelligence (AGI). Since its global launch, Vidu has possessed the capability to generate consistent characters by locking in facial features, addressing a key pain point in video generation. In September, Vidu globally pioneered the "subject consistency" feature, extending facial consistency to full-body consistency and expanding the scope to any subject including animals, objects, and virtual characters. Vidu's technological breakthroughs are mainly reflected in three aspects: precise control over complex subjects, natural consistency of facial features and dynamic expressions, and multi-subject consistency.

WeChat Screenshot_20241113135537.png

WeChat Screenshot_20241113135531.png

Vidu 1.5 demonstrates a new "emergence of intelligence" in visual models, showcasing its powerful contextual learning capabilities. This means that visual models not only possess the ability to understand and imagine but can also manage memory during the generation process. Vidu 1.5 continues its industry-leading generation efficiency, producing a video in under 30 seconds. Vidu adheres to the philosophy of universality, consistent with Large Language Models (LLM), unifying all issues into visual input and output problems, using a single Transformer to model variable-length inputs and outputs, and obtaining intelligence from video data compression.

The launch of Vidu 1.5 not only enhances the controllability of video models but also achieves consistent generation from multiple angles, with multiple subjects, and multiple elements through flexible diverse inputs. This marks the emergence of visual intelligence and accelerates the arrival of AGI. Vidu is no longer just a high-quality, efficient video generator; it can also integrate contextual information and memory during the generation process, a significant leap in visual modality intelligence. Visual models will possess stronger cognitive abilities, becoming a crucial piece in the puzzle of AGI.

Experience URL: www.vidu.studio

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team