Zhipu AI Announces Open Source of 'Qingying' Homogeneous Video Generation Model - CogVideoX

AIbase基地

Published inAI News · 3 min read · Aug 6, 2024

457

ZhipuAI has announced the open-sourcing of its video generation model, CogVideoX, aiming to accelerate the development and application of video generation technology. The CogVideoX model, based on advanced large-scale model technology, meets the demands of commercial-grade applications. The currently open-sourced CogVideoX-2B version requires only 18GB of GPU memory for inference at FP-16 precision, and 40GB for fine-tuning, enabling inference with a single 4090 GPU and fine-tuning with a single A6000 GPU.

CogVideoX employs a 3D Variational Autoencoder (3D VAE) technology, which compresses both spatial and temporal dimensions of videos simultaneously through 3D convolutions, achieving higher compression rates and better reconstruction quality. The model structure includes an encoder, decoder, and latent space regularizer, ensuring causal information through temporal causal convolutions. Additionally, expert Transformer technology is used to process encoded video data, combining text inputs to generate high-quality video content.

WeChat Screenshot_20240806095428.png

To train the CogVideoX model, ZhipuAI has developed a method for screening high-quality video data, excluding overly edited or inconsistently moving videos, ensuring the quality of the training data. Additionally, a pipeline from image captioning to video captioning has been implemented to address the lack of textual descriptions in video data.

In terms of performance evaluation, CogVideoX excels in multiple metrics, including human actions, scenes, dynamic levels, and evaluation tools focused on video dynamics. ZhipuAI will continue to explore innovations in video generation, including new model architectures, video information compression, and the fusion of text and video content.

Code Repository:

https://github.com/THUDM/CogVideo

Model Download:

https://huggingface.co/THUDM/CogVideoX-2b

Technical Report:

https://github.com/THUDM/CogVideo/blob/main/resources/CogVideoX.pdf

Zhipu AI CogVideoX 3D Variational Autoencoders Expert Transformer

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Tencent's Self-developed Large Model Hunyuan 2.0 Released: Significant Improvements in Multiple Aspects

Tencent's self-developed large model Hunyuan 2.0 (Tencent HY2.0) has been officially released. At the same time, DeepSeek V3.2 is gradually integrated into Tencent's ecosystem. Currently, these two models have been launched first in Tencent's AI-native applications such as Yuanbao and ima. Tencent Cloud has also simultaneously opened up related model APIs and platform services. The newly released Tencent HY2.0 adopts a Mixture of Experts (MoE) architecture, with a total parameter count of up to 4

Dec 6, 2025

130

WhatsApp Blocks Third-Party AI Chatbots Sparks EU Antitrust Investigation, Meta May Face a $16.4 Billion Fine

The European Commission has launched an antitrust investigation into Meta, questioning its new WhatsApp Business API policy that only allows its own Meta AI to access, banning third-party AI chatbots like ChatGPT, and涉嫌 abusing its market dominance. The new policy prohibits third-party AI chatbots from accessing the API starting October 2025, and services integrated before January 15, 2026 must exit, with unclear exemptions.

Dec 5, 2025

180

NVIDIA Launches New AI Framework, 8-Billion-Parameter Model Empowers Intelligent Tool Management

NVIDIA and HKU launched the 8-billion-parameter Orchestrator model, which coordinates tools and LLMs to solve complex tasks efficiently. It outperforms benchmarks in tool usage with lower costs and adapts to user preferences. Trained via the ToolOrchestra RL framework, it enhances small models' coordination skills.....

Dec 5, 2025

120

2025 Global Top 500 Unicorn Companies Revealed! SpaceX, ByteDance, and OpenAI Lead the Way, Chinese Companies Strongly Enter the List

On December 3, the 2025 Global Top 500 Unicorn Companies Conference was held in Laoshan District, Qingdao. The conference released the '2025 Global Top 500 Unicorn Companies Report', with evaluation criteria including a valuation of over 7 billion yuan, unique technology, and business models. The report shows that the total valuation of global unicorn companies in 2025 reached 3.914 trillion yuan, achieving growth compared to last year.

Dec 5, 2025

160

The Japanese Government Uses AI Technology to Early Identify Adolescents with Suicidal Tendencies

The Japanese government is advancing an AI initiative aimed at early identification of adolescents with suicidal tendencies and providing psychological support to address the issue of adolescent suicide. This effort comes amid increasing discussions about the negative impacts of AI, particularly following recent lawsuits against OpenAI over AI tools that may induce suicide among teenagers, sparking widespread public concern about the risks of AI applications.

Dec 5, 2025

110

AI Daily: KlingAIAvatar 2.0 Launches; Google Introduces Gemini 3 Deep Think Mode; Alibaba Cloud XiYan-SQL Strongly Wins

Welcome to the 【AI Daily】 section! Here is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. KlingAIAvatar 2.0 Launches and Goes Viral: Generate a 5-minute dance and song in one click, the digital human officially bids farewell to the 'pale face' era. KlingAIAvatar 2.0 uses multimodal directing

Dec 5, 2025

200

Gaode Launches AI Parking Radar: Minute-Level Prediction of Available Parking Spaces, Beijing Leads the Way

Gaode Map's 'AI Parking Radar' uses spatial intelligence and AI vision to provide real-time, minute-level updates on street parking availability, easing parking anxiety with lane-level navigation views. Currently available in Beijing, covering tens of thousands of spots, it extends navigation apps into parking services.....

Dec 5, 2025

150

American Broadcaster Falls into a Harassment Scandal Due to AI Advice, Faces 70 Years in Prison!

A 31-year-old podcaster faces charges for cyberstalking and interstate threats, potentially resulting in 70 years in prison and a $3.5 million fine. He expressed a desire for a 'wife' and extreme anger toward women on social media, referring to ChatGPT as his 'best friend,' highlighting AI's negative role in the case.....

Dec 5, 2025

150

Alibaba Launches China's First Smart Agent for Autism Children's Picture Books, 'Chasing Stars AI' Lands on Qwen APP

Alibaba launched the 'Chasing Stars AI' picture book smart agent on International Volunteer Day, landing on Qwen APP. This smart agent was jointly developed by Alibaba volunteers and Moba Community developers, specifically designed for children with autism, offering emotional companionship and personalized reading experiences. The 2.0 version adds a 'Generate a Picture Book in One Sentence' feature, allowing users to input a story idea, and the AI will automatically generate picture book content suitable for the developmental characteristics of children with autism.

Dec 5, 2025

110

Norton Launches the World's First Secure AI Browser, Norton Neo, Free for Download

Norton launches the world's first free secure AI native browser, "Norton Neo", aimed at addressing online security challenges brought by the rapid development of AI technology, providing users with a smarter and more trustworthy browsing experience.

Dec 5, 2025

160

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

Zhipu AI Announces Open Source of 'Qingying' Homogeneous Video Generation Model - CogVideoX

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent's Self-developed Large Model Hunyuan 2.0 Released: Significant Improvements in Multiple Aspects

WhatsApp Blocks Third-Party AI Chatbots Sparks EU Antitrust Investigation, Meta May Face a $16.4 Billion Fine

NVIDIA Launches New AI Framework, 8-Billion-Parameter Model Empowers Intelligent Tool Management

2025 Global Top 500 Unicorn Companies Revealed! SpaceX, ByteDance, and OpenAI Lead the Way, Chinese Companies Strongly Enter the List

The Japanese Government Uses AI Technology to Early Identify Adolescents with Suicidal Tendencies

AI Daily: KlingAIAvatar 2.0 Launches; Google Introduces Gemini 3 Deep Think Mode; Alibaba Cloud XiYan-SQL Strongly Wins

Gaode Launches AI Parking Radar: Minute-Level Prediction of Available Parking Spaces, Beijing Leads the Way

American Broadcaster Falls into a Harassment Scandal Due to AI Advice, Faces 70 Years in Prison!

Alibaba Launches China's First Smart Agent for Autism Children's Picture Books, 'Chasing Stars AI' Lands on Qwen APP

Norton Launches the World's First Secure AI Browser, Norton Neo, Free for Download

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Zhipu AI Announces Open Source of 'Qingying' Homogeneous Video Generation Model - CogVideoX

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent's Self-developed Large Model Hunyuan 2.0 Released: Significant Improvements in Multiple Aspects

WhatsApp Blocks Third-Party AI Chatbots Sparks EU Antitrust Investigation, Meta May Face a $16.4 Billion Fine

NVIDIA Launches New AI Framework, 8-Billion-Parameter Model Empowers Intelligent Tool Management

2025 Global Top 500 Unicorn Companies Revealed! SpaceX, ByteDance, and OpenAI Lead the Way, Chinese Companies Strongly Enter the List

The Japanese Government Uses AI Technology to Early Identify Adolescents with Suicidal Tendencies

AI Daily: KlingAIAvatar 2.0 Launches; Google Introduces Gemini 3 Deep Think Mode; Alibaba Cloud XiYan-SQL Strongly Wins

Gaode Launches AI Parking Radar: Minute-Level Prediction of Available Parking Spaces, Beijing Leads the Way

American Broadcaster Falls into a Harassment Scandal Due to AI Advice, Faces 70 Years in Prison!

Alibaba Launches China's First Smart Agent for Autism Children's Picture Books, 'Chasing Stars AI' Lands on Qwen APP

Norton Launches the World's First Secure AI Browser, Norton Neo, Free for Download

GEO Services