AI News

AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

Masked Diffusion Transformer (MDT)

Masked Diffusion Transformer is the latest technology in image synthesis, achieving SOTA (State of the Art) at ICCV 2023.

CommonProductImageImageImage Synthesis

MDT explicitly enhances the ability of diffusion probability models (DPMs) to learn relationships between object parts in images by introducing a masked latent model scheme. MDT operates in the latent space during training, masking certain tokens, and then designs an asymmetrical diffusion transformer to predict masked tokens from unmasked tokens while maintaining the diffusion generation process. MDTv2 further improves the performance of MDT through more efficient macro network structures and training strategies.

Masked Diffusion Transformer (MDT)

Masked Diffusion Transformer (MDT) Visit Over Time

Monthly Visits

521149929

Bounce Rate

35.96%

Page per Visit

6.1

Visit Duration

00:06:29

Masked Diffusion Transformer (MDT) Visit Trend

Masked Diffusion Transformer (MDT) Visit Geography

Masked Diffusion Transformer (MDT) Traffic Sources

Masked Diffusion Transformer (MDT) Alternatives

Masked Diffusion Transformer (MDT) — Masked Diffusion Transformer is the latest technology in image synthesis, achieving SOTA (State of the Art) at ICCV 2023.

•Image•Image Synthesis

Sana-1.6B — Linear diffusion transformer for high-resolution image synthesis

•Image Synthesis•Deep Learning

Sana — High-efficiency high-resolution image synthesis framework

•Image Synthesis•Text to Image

FILM — Frame interpolation model for large-scale action scenes

•Image•Video

d1 — Improving the reasoning capabilities of diffusion large language models using reinforcement learning.

•Reasoning•Reinforcement Learning

Wan2.1-FLF2V-14B — Open-source video generation model supporting multiple generation tasks.

ChineseSelection

•Video Generation•Deep Learning

FramePack — A next-frame prediction model for video generation.

•Video Generation•AI Technology

Liquid — A multimodal generative model integrating visual understanding and generation.

•Multimodal•Generative Model

GLM-4-32B — A powerful language model supporting various natural language processing tasks.

ChineseSelection

•Natural Language Processing•Deep Learning

Pusa — Pusa is a novel video diffusion model that supports various video generation tasks.

•Video Generation•Open Source

UNO — A tool that improves the consistency of image generation through a generative model.

•Image Generation•Open Source

VisualCloze — A general-purpose image generation framework that learns through visual context.

•Image Generation•Visual Learning

SkyReels-A2 — A framework for synthesizing any content in a video diffusion transformer.

•Video Generation•Deep Learning

MegaTTS 3 — A highly efficient speech synthesis model that supports Chinese, English, and speech cloning.

•Speech Synthesis•Deep Learning

EasyControl — Provides an efficient and flexible control framework for Diffusion Transformer.

•Diffusion Transformer•Image Generation

DreamActor-M1 — A human image animation framework based on DiT, achieving fine-grained control and long-term consistency.

•Human Animation•Video Generation

QVQ-Max — An advanced visual reasoning model that can analyze image and video content.

ChineseSelection

•Visual Reasoning•Deep Learning

Video-T1 — Significantly improves video generation quality through test-time scaling.

•Video Generation•Test-Time Scaling

RF-DETR — RF-DETR is a real-time object detection model developed by Roboflow.

•Object Detection•Deep Learning

HunYuan T1

HunYuan T1 — The industry's first ultra-large-scale hybrid Mamba reasoning model, with strong reasoning capabilities.

ChineseSelection

•Reasoning Model•Artificial Intelligence

HunYuan T1 — An industry-leading deep reasoning large model, optimized for human preferences.

ChineseSelection

•Deep Learning•Reasoning Model

InfiniteYou — Achieve flexible and high-fidelity image generation while preserving identity characteristics.

•Image Generation•Identity Preservation

Pruna — Pruna is a model optimization framework that helps developers deliver models quickly and efficiently.

•Model Optimization•Machine Learning

Long Context Tuning (LCT) — A technology that enhances scene-level video generation capabilities.

•Video Generation•Deep Learning

Thera — An aliasing-free arbitrary-scale super-resolution method.

•Super-resolution•Image processing

IMM — Inductive Moment Matching is a novel generative model for high-quality image generation.

•Generative Model•Image Generation

MIDI — Generates high-fidelity 3D scenes from a single image using a multi-instance diffusion model.

•3D Modeling•Image Processing

R1-Omni — R1-Omni is a full-modality emotion recognition model incorporating reinforcement learning, focusing on improving the interpretability of multimodal emotion recognition.

•Multimodal•Emotion Recognition

VideoPainter

VideoPainter — VideoPainter is a tool that supports video repair and editing of any length, using a text-guided plug-in framework.

•Video Repair•Text-guided

Bytedance Flux — Flux is a fast communication overlap library for tensor/expert parallelism on GPUs.

•Deep Learning•Parallel Computing