Aisi Technology's AIsphere Launches Video Generation Product PixVerse V2: Single Clip Up to 8 Seconds, Multi-Clip Up to 40 Seconds

AIbase基地

Published inAI News · 3 min read · Jul 25, 2024

309

AIS Technology recently unveiled its video generation product, PixVerse V2, an innovative tool based on an AI video large model aimed at helping users unleash their creative potential. PixVerse V2 adopts the Diffusion+Transformer (DiT) foundational architecture and has undergone technological innovations in multiple aspects, making video generation smoother, more consistent, and more engaging.

WeChat Screenshot_20240725084713.png

Key features include:

Spatio-temporal attention mechanism: PixVerse V2 introduces a proprietary spatio-temporal attention mechanism that enhances the perception of space and time, especially in handling complex scenes.
Text comprehension: With a multimodal model, PixVerse V2 can more accurately align text information with video information, strengthening the model's understanding and expressive capabilities.
Optimized model training: On the basis of the traditional flow model, PixVerse V2 promotes faster and better convergence of the model through weighted loss, improving overall training efficiency.
Video generation capability: PixVerse V2 supports the generation of multiple video clips at once, with a single clip reaching up to 8 seconds and multiple clips up to 40 seconds, while maintaining consistency between clips.
User-friendly features: PixVerse V2 allows for the one-click generation of 1-5 continuous video segments, with consistency in subject image, screen style, and scene elements between segments. Additionally, users can edit the generated results a second time, flexibly replacing and adjusting video content.

The AIS Technology team plans to conduct multiple iterative upgrades within the next three months to provide an even better AI video generation experience. The goal of PixVerse V2 is to make AI video creation more convenient and efficient, whether for recording daily life or telling video stories, it can be easily achieved.

AishiTechnology PixVerseV2 AIVideoLargeModel SpatiotemporalAttentionMechanism

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OpenAI's Mental Health Safety Lead Joins Anthropic, Revealing the Debate on AI Model Emotional Defense

AI chatbots are deeply involved in human emotional lives, and addressing user psychological crises has become an urgent ethical challenge in the industry. Recently, Andrea Volonino, the former head of model policy at OpenAI, left the company to join her former supervisor at competitor Anthropic. During her time at OpenAI, she was responsible for the safety policies of GPT-4 and the next-generation reasoning models, and her departure highlights the unprecedented ethical dilemmas in the field of AI emotional interaction.

Jan 16, 2026

130

OpenAI Secret Hardware Plan! Prototype Audio Device 'Sweetpea' Exposed, Designed by Jony Ive, Targeting 50 Million Units Shipped in the First Year

The company plans to launch the AI audio device 'Sweetpea' in September 2026, targeting 40-50 million units in its first year. It features an oval metal shell, dual-capsule rear-hook design, a 2nm AI chip, and multi-modal components like EMG sensors.....

Jan 14, 2026

230

Huatu Shanding AI Breakthrough in Civil Service Exam Essay Grading: Accurate Scoring in 2 Minutes, OMO Model Redefining the Education and Training Experience

Huatu Shanding utilizes its self-developed AI technology to revolutionize the essay grading process in civil service exam training. Traditional manual grading faces pain points such as slow feedback, high costs, and inconsistent standards. This AI system transforms subjective evaluation into quantifiable and traceable intelligent assessment, promoting the education and training services towards efficiency, accuracy, and personalization.

Jan 13, 2026

180

2 Months Generates 1 Billion Images! Google Nano Banana Pro Becomes a Global Sensation with Studio-Level Image Quality

The Google Gemini3Pro image generation model has generated over 1 billion images in two months. It supports local editing, lens adjustment, and lighting control, and can output 2K/4K multi-language text images, significantly enhancing creative control.

Jan 13, 2026

190

Lightricks Open-Sources AI Video Model LTX-2 for High-Speed Audio-Visual Integration of Up to 20 Seconds

Lightricks launches LTX-2, an AI system generating 20-second HD videos with synchronized audio from text, using dual-stream architecture and 19B parameters for enhanced realism.....

Jan 12, 2026

170

GPT-5.2 Surpasses Humans! ARC-AGI-2 Sets a New Record, Triggering an Era of Excessive Capabilities: The Bottleneck of AI Lies Not in the Model, but in Humans

GPT-5.2 surpasses human average (60%) with 75% accuracy in ARC-AGI-2, marking a key breakthrough in AI general intelligence, yet highlighting the performance gap between testing and real-world application.....

Jan 12, 2026

170

GPT-5.2 Performance Exceeds Human Benchmark for the First Time: OpenAI Warns of the Era of Excessive Large Model Capabilities

OpenAI's GPT-5.2 surpasses human baseline in ARC-AGI-2, excelling in abstract reasoning and generalization, marking a step toward expert-level AI.....

Jan 12, 2026

190

Rumors of DeepSeek V4 Release During Spring Festival: Focus on AI Programming, Core Capabilities May Exceed Claude

Chinese AI company DeepSeek is about to release its new large model DeepSeek V4, focusing on enhancing code generation capabilities and targeting the competitive AI programming market.

Jan 12, 2026

270

From Electronic Frame to Family Smart Hub: Skylight Calendar 2 Released, Redefining Family Collaboration with AI

Skylight transitions from digital frames to a home digital assistant, launching Calendar2 at CES 2026. It features customizable colored frames and an AI-driven system for unified family collaboration.....

Jan 9, 2026

210

Open-Source Version of Veo 3 Is Here: LTX-2 Officially Released - Generate a 20-Second 4K AI Video with Synchronized Audio and Video in One Go - Run Smoothly on Local Graphics Cards

Lightricks open-sources LTX-2 model, enabling 20-second 4K video generation with seamless synchronization of visuals, audio, lip movements, ambient sounds, and music. Full model weights, training code, benchmarks, and toolkit are available on GitHub, receiving enthusiastic community response.....

Jan 7, 2026

240

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Aisi Technology's AIsphere Launches Video Generation Product PixVerse V2: Single Clip Up to 8 Seconds, Multi-Clip Up to 40 Seconds

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI's Mental Health Safety Lead Joins Anthropic, Revealing the Debate on AI Model Emotional Defense

OpenAI Secret Hardware Plan! Prototype Audio Device 'Sweetpea' Exposed, Designed by Jony Ive, Targeting 50 Million Units Shipped in the First Year

Huatu Shanding AI Breakthrough in Civil Service Exam Essay Grading: Accurate Scoring in 2 Minutes, OMO Model Redefining the Education and Training Experience

2 Months Generates 1 Billion Images! Google Nano Banana Pro Becomes a Global Sensation with Studio-Level Image Quality

Lightricks Open-Sources AI Video Model LTX-2 for High-Speed Audio-Visual Integration of Up to 20 Seconds

GPT-5.2 Surpasses Humans! ARC-AGI-2 Sets a New Record, Triggering an Era of Excessive Capabilities: The Bottleneck of AI Lies Not in the Model, but in Humans

GPT-5.2 Performance Exceeds Human Benchmark for the First Time: OpenAI Warns of the Era of Excessive Large Model Capabilities

Rumors of DeepSeek V4 Release During Spring Festival: Focus on AI Programming, Core Capabilities May Exceed Claude

From Electronic Frame to Family Smart Hub: Skylight Calendar 2 Released, Redefining Family Collaboration with AI

Open-Source Version of Veo 3 Is Here: LTX-2 Officially Released - Generate a 20-Second 4K AI Video with Synchronized Audio and Video in One Go - Run Smoothly on Local Graphics Cards

AI News Recommendations

OpenAI's Mental Health Safety Lead Joins Anthropic, Revealing the Debate on AI Model Emotional Defense

OpenAI Secret Hardware Plan! Prototype Audio Device 'Sweetpea' Exposed, Designed by Jony Ive, Targeting 50 Million Units Shipped in the First Year

Huatu Shanding AI Breakthrough in Civil Service Exam Essay Grading: Accurate Scoring in 2 Minutes, OMO Model Redefining the Education and Training Experience

2 Months Generates 1 Billion Images! Google Nano Banana Pro Becomes a Global Sensation with Studio-Level Image Quality

Lightricks Open-Sources AI Video Model LTX-2 for High-Speed Audio-Visual Integration of Up to 20 Seconds

GPT-5.2 Surpasses Humans! ARC-AGI-2 Sets a New Record, Triggering an Era of Excessive Capabilities: The Bottleneck of AI Lies Not in the Model, but in Humans

GPT-5.2 Performance Exceeds Human Benchmark for the First Time: OpenAI Warns of the Era of Excessive Large Model Capabilities

Rumors of DeepSeek V4 Release During Spring Festival: Focus on AI Programming, Core Capabilities May Exceed Claude

From Electronic Frame to Family Smart Hub: Skylight Calendar 2 Released, Redefining Family Collaboration with AI

Open-Source Version of Veo 3 Is Here: LTX-2 Officially Released - Generate a 20-Second 4K AI Video with Synchronized Audio and Video in One Go - Run Smoothly on Local Graphics Cards

GEO Services