AIbase
Product LibraryTool Navigation

LoVA

Public

The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) architecture, LoVA proves to be more effective at generating long-form audio compared to existing autoregressive models and UNet-based diffusion models.

Creat2024-11-27T15:58:47
Update2025-02-27T16:49:26
https://ceaglex.github.io/LoVA.github.io/
12
Stars
0
Stars Increase

Related projects