Diffusion as Shader

A unified architectural model supporting various video generation control tasks.

CommonProductVideoVideo generation3D perception

Diffusion as Shader (DaS) is an innovative video generation control model designed to achieve diverse control over video generation through a 3D perception diffusion process. This model utilizes 3D tracking videos as control inputs, supporting multiple video control tasks under a unified architecture, including mesh-to-video generation, camera control, motion transfer, and object manipulation. The main advantage of DaS lies in its 3D perception capabilities, significantly enhancing the temporal consistency of generated videos and demonstrating powerful control abilities with minimal data and short tuning times. Developed collaboratively by research teams from institutions like the Hong Kong University of Science and Technology, the model aims to advance video generation technology, providing more flexible and efficient solutions for fields such as filmmaking and virtual reality.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Diffusion as Shader

Diffusion as Shader Visit Over Time

Diffusion as Shader Visit Trend

Diffusion as Shader Visit Geography

Diffusion as Shader Traffic Sources

Diffusion as Shader Alternatives

Diffusion as Shader — A unified architectural model supporting various video generation control tasks.

AccVideo — Accelerated video diffusion model, generating speed increased by 8.5 times.

AnchorCrafter — A 2D video generation system based on diffusion models, achieving human-object interaction animations.

Fashion-VDM — A video diffusion model for virtual try-on.

MarDini — A self-regressive diffusion model for large-scale video generation.

genmoai — Open-source video generation model

UniAnimate — A model for efficiently generating consistent character video animations

MuseV — MuseV is a video generation model capable of generating high-fidelity virtual person videos of unlimited length.

Sora — Large-scale video generation diffusion model

Generative Rendering: 2D Mesh — Control video generation model

Emu Video — AI-driven text-to-video generation

Wan2.1-FLF2V-14B — Open-source video generation model supporting multiple generation tasks.

FramePack — A next-frame prediction model for video generation.

Pusa — Pusa is a novel video diffusion model that supports various video generation tasks.

Dream 7B — Dream 7B is a state-of-the-art open diffusion large language model.

SkyReels-A2 — A framework for synthesizing any content in a video diffusion transformer.

DreamActor-M1 — A human image animation framework based on DiT, achieving fine-grained control and long-term consistency.

GAIA-2 — GAIA-2 is an advanced video generation model for creating safe autonomous driving scenarios.

Video-T1 — Significantly improves video generation quality through test-time scaling.

vivago.ai — Free AI creation tool, generating images, videos, and 4K enhancement.

Long Context Tuning (LCT) — A technology that enhances scene-level video generation capabilities.

MM_StoryAgent — MM_StoryAgent is a multi-agent framework for generating immersive story videos.

Flat Color - Style — A LoRA model for generating lineless, flat-color style images and videos, suitable for anime and design fields.

Wan.video — Wan_AI Creative Drawing is a platform that uses artificial intelligence technology for creative painting and video creation.

Inception Labs — Inception Labs launches a new generation of diffusion-based large language models, offering extremely fast, efficient, and high-quality language generation capabilities.

HunyuanVideo-I2V — HunyuanVideo-I2V is an image-to-video generation framework based on HunyuanVideo, launched by Tencent.

Project Starlight — Project Starlight is an AI-based video enhancement tool that upgrades low-resolution and damaged videos to high-definition quality.

Wan2GP — Wan2GP is an optimized open-source video generation model designed for low-configuration GPU users, supporting multiple video generation tasks.

Mercury Coder — Mercury Coder is a high-performance code generation language model based on diffusion models.

hunyuan-video-keyframe-control-lora — This is an adapter based on the HunyuanVideo model, used for keyframe-based video generation.