Higher Quality, Better Visual Effects! Zhipu Open Source CogVideoX-5B Video Generation Model

AIbase基地

Published inAI News · 2 min read · Aug 28, 2024

329

The ModelScope community recently announced the official open-source release of a larger version of its domestic open-source Sora video generation model, CogVideoX-5B.

Compared to the previous CogVideoX-2B, the new model has significantly improved the quality and visual effects of video generation.

WeChat Screenshot_20240828081448.png

CogVideoX-5B is a large-scale DiT (diffusion transformer) model designed specifically for text-to-video generation tasks. The model employs a 3D causal variational autoencoder (3D causal VAE) and expert Transformer technology, combining text and video embeddings, using 3D-RoPE for positional encoding, and leveraging a 3D full attention mechanism for spatiotemporal joint modeling.

Additionally, the model incorporates progressive training techniques, enabling the generation of high-quality videos with significant motion characteristics, coherence, and extended duration.

Model Link:

https://modelscope.cn/models/ZhipuAI/CogVideoX-5b

CogVideoX Large Scale DiT 3D Causal Variational Autoencoder 3D Full Attention Mechanism

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

1.Tencent's Hunyuan3D-PolyGen boosts 3D modeling efficiency by 70% with BPT tech. 2.Alibaba's HumanOmniV2 achieves 69.33% accuracy in multilingual input. 3.DingTalk AI processes 1k tasks/hour with 'spreadsheet-as-document'. 4.Baidu PaddleOCR3.1 improves 37-language recognition by 30%. 5.Microsoft Deep Research opens API. 6.HKPolyU & OPPO's DLoRAL speeds video enhancement 10x. 7.Google opens MCP Toolbox for SQL. 8.Microsoft Win11 to add AI dynamic....

Jul 8, 2025

110

Baidu's Stock Rises, Intelligent Cloud Wins Double Champion in Large Model Market in the First Half of the Year

Baidu's stock surged 5% to $90.68, driven by China's booming AI model market. With 1810 projects totaling $64M in H1 2025, Baidu Intelligent Cloud leads in finance and energy sectors, securing 48 projects worth $51M. Partnered with 65% of state-owned enterprises for AI.....

Jul 8, 2025

Apple AI CEO Joins Meta, Attracting Industry Attention

Apple AI executive Ruoming Pang has joined Meta, joining its newly established Super Intelligence Lab with a salary of millions of dollars. Meta recently restructured its AI business, led by former Scale AI CEO Alexandr Wang, and invested 2.9 billion dollars in Scale AI. This move highlights the increasing competition among tech giants for AI talent, as Meta accelerates its layout through high salaries and strategic investments. Apple and Meta have not commented on the matter.

Jul 8, 2025

Tencent Hunyuan Launches the Industry's First Art-Level 3D Generation Large Model Hunyuan3D-PolyGen

On July 7, the Tencent Hunyuan 3D team announced the launch of the industry's first art-level 3D generation large model, Hunyuan3D-PolyGen. By employing self-developed high-compression representation BPT technology and a autoregressive mesh generation framework, it enables accurate generation of complex geometric models with up to ten thousand faces. The model has breakthrough solutions for core pain points in 3D asset generation, such as poor topology quality, excessive face count, and difficulty in post-editing. It has improved the modeling efficiency of artists by over 70%. The relevant capabilities have been launched on the Tencent Hunyuan 3D AI creation engine and integrated into multiple game pipelines. Traditional

Jul 8, 2025

230

Tencent Sets a New High! The First Art-Level 3D Generation Large Model Makes a Stunning Debut, Enhancing Modeling Efficiency by Over 70%!

Tencent launched Hunyuan3D-PolyGen, the industry's first art-grade 3D generation model, using self-developed BPT technology to enhance wiring quality and complex object modeling. It generates high-precision geometric models, supports multiple surface types, and boosts gaming pipeline efficiency by 70+%.....

Jul 8, 2025

220

Musk Announces Live Broadcast Launch of Grok 4 on July 10! AI Large Models Spark Discussion

Musk announces Grok4 AI model launch on July 11, sparking debates on AI ethics due to controversial 'politically incorrect' responses. New rules require multi-source analysis and evidence-based disputes.....

Jul 8, 2025

Feidu Technology Launches Zhenrong Large Model, the Digital Twin Enters a New Intelligent Era!

Feidu Tech launched 'Zhengrong Model', excelling in City3D tests with top-tier modeling and semantic understanding. It aids disaster simulation and heritage conservation, offering API for industry AI advancement.....

Jul 7, 2025

190

Claude is about to release the Claude Neptune v3 model with strong mathematical capabilities

Anthropic is testing 'Claude Neptune v3', a new AI model with strong math skills, possibly rivaling OpenAI/Google. It may be a Claude4.5 precursor or breakthrough version, aiming to lead in the competitive AI market.....

Jul 7, 2025

480

Tencent Open-Sourced Huan Yuan-A13B: A Dynamic Inference Large Model, Focused on Thinking

Tencent open-sourced 'Hunyuan-A13B', an 80B-parameter MoE model with dynamic inference (130B active params). Supports 256K context, trained on 20T tokens. Achieves 87.3% on AIME2024 math, but comparisons show version disparities.....

Jul 7, 2025

270

Scholars Use Hidden AI Prompts to Influence Peer Review, Sparking Attention in the Academic Community

Research reveals scholars from 8 countries embedded hidden AI prompts in papers to influence reviews. 17 CS papers used white text/tiny fonts to request 'positive evaluations' from AI reviewers, raising academic integrity concerns.....

Jul 7, 2025

210

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Higher Quality, Better Visual Effects! Zhipu Open Source CogVideoX-5B Video Generation Model

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Tencent Huyaun Launches 3D Generation Large Model Hunyuan3D-PolyGen; DingTalk AI Spreadsheet Makes a Big Entry; Alibaba Launches Multimodal Large Language Model HumanOmniV2

Baidu's Stock Rises, Intelligent Cloud Wins Double Champion in Large Model Market in the First Half of the Year

Apple AI CEO Joins Meta, Attracting Industry Attention

Tencent Hunyuan Launches the Industry's First Art-Level 3D Generation Large Model Hunyuan3D-PolyGen

Tencent Sets a New High! The First Art-Level 3D Generation Large Model Makes a Stunning Debut, Enhancing Modeling Efficiency by Over 70%!

Musk Announces Live Broadcast Launch of Grok 4 on July 10! AI Large Models Spark Discussion

Feidu Technology Launches Zhenrong Large Model, the Digital Twin Enters a New Intelligent Era!

Claude is about to release the Claude Neptune v3 model with strong mathematical capabilities

Tencent Open-Sourced Huan Yuan-A13B: A Dynamic Inference Large Model, Focused on Thinking

Scholars Use Hidden AI Prompts to Influence Peer Review, Sparking Attention in the Academic Community