FLOAT

Audio-driven talking avatar video generation method based on flow matching.

CommonProductImageArtificial IntelligenceAvatar Animation

Visit

FLOAT is an audio-driven avatar video generation technique that utilizes a flow matching generative model, transitioning the generative modeling from pixel-based latent space to learned motion latent space, achieving temporally coherent motion design. This technology incorporates a transformer-based vector field predictor and features a straightforward yet effective per-frame conditioning mechanism. Additionally, FLOAT supports speech-driven emotional enhancement, allowing for the natural integration of expressive motion. Extensive experiments demonstrate that FLOAT outperforms existing audio-driven avatar methods in visual quality, motion fidelity, and efficiency.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

FLOAT

FLOAT Visit Over Time

FLOAT Visit Trend

FLOAT Visit Geography

FLOAT Traffic Sources

FLOAT Alternatives

FLOAT — Audio-driven talking avatar video generation method based on flow matching.

CyberHost — End-to-end audio-driven human animation framework

OmniAvatar — Efficient audio-driven avatar video generation with adaptive body animation.

JoyVASA — Audio-driven character and animal image animation technology based on diffusion models

INFP — An audio-driven interactive head generation framework designed for two-person conversations.

MEMO — An audio-driven model for generating expressive talking videos.

Hallo2 — High-resolution facial animation technology driven by long-duration audio

LiteAvatar — An audio-driven real-time 2D chatting avatar generation model that achieves 30fps real-time inference on CPU-only devices.

Loopy Model — Loopy generates lifelike dynamic portraits driven solely by audio.

VideoReTalking — Audio-driven video editing for high-quality lip-sync synchronization.

JoyGen — JoyGen is an audio-driven, 3D depth-aware talking-face video editing technology.

Physical Intelligence — Bringing General Artificial Intelligence to the Physical World

MagicAvatar — Multi-modal Avatar Generation and Animation

VividTalk — Generate realistic, lip-synced rap videos

Sora AI Video Generator — Generate audio and video content with artificial intelligence

AI Voice Generator Bot — Utilizing artificial intelligence to convert text into audio.

SyncAnimation — SyncAnimation is a technology framework based on NeRF that enables real-time generation of speaking avatars and upper body movements driven by audio.

GaussianSpeech — Audio-driven high-fidelity 3D head avatar synthesis technology

GAIA — Voice-Driven Conversational Avatar Generation

Dublai.com — Powered by artificial intelligence for voiceovers

Color Avatar — Online Avatar Generator

PhotoBoutique - AI Avatar Maker — AI Avatar Generator

AI By Doing: Hands-On Artificial Intelligence — An introductory tutorial website for artificial intelligence, providing comprehensive knowledge of machine learning and deep learning.

AniPortrait — Generates dynamic videos of faces that speak and sing.

EchoMimic — Advanced technology for generating realistic dynamic face videos

DeepSeek-Manim-Animation-Generator — A Manim animation generation tool based on the DeepSeek API for quickly creating mathematical and scientific animations.

Audio Transcription Tool — Fast, Accurate, and Free Audio to Text Service

Wonder Animation — An AI solution launched by Autodesk, Wonder Animation is an animation production tool that converts video into 3D scenes.

DreamWaltz-G — Text-driven 3D avatar generation and expressive full-body animation

Infinite Avatar — AI-powered avatar generator for instant creation of unique avatars.