LoVA

Public

The code and weight for LoVA. LoVA is a novel model for Long-form Video-to-Audio generation. Based on the Diffusion Transformer (DiT) architecture, LoVA proves to be more effective at generating long-form audio compared to existing autoregressive models and UNet-based diffusion models.

audio-generation multimodal video-to-audio

Creat：2024-11-27T15:58:47

Update：2025-02-27T16:49:26

https://ceaglex.github.io/LoVA.github.io/

Stars

Stars Increase

Related projects

Stable Diffusion Webui

Hot

Stable Diffusion web UI

158833

1年前

+73today

Deep Live Cam

Hot

real time face swap and one-click video deepfake with only a single image

76195

10个月前

+63today

Gpt Engineer

CLI platform to experiment with codegen. Precursor to: https://lovable.dev

55097

9个月前

+10today

Anything Llm

Hot

agent-framework-javascript

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

51978

8个月前

+113today

Llm App

chatbot

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. ?Docker-friendly.?Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

47713

1年前

+16today

LocalAI

Hot

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

39967

8个月前

+190today

AppSmith

api

Tree view based UML C# MS-SQL OpenApi Database Api table modeler - Code Generator. Saves model to files no db required. Text editors to copy and paste to/from.

38590

8个月前

+25today

Langchain Chatchat

chatbot

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain

36740

10个月前

+21today

NewPipe

A libre lightweight streaming front-end for Android.

35866

8个月前

+39today

Retrieval Based Voice Conversion WebUI

audio-analysis

Easily train a good VC model with voice data <= 10 mins!

33271

8个月前

+37today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

LoVA

Related projects

Stable Diffusion Webui

Deep Live Cam

Gpt Engineer

Anything Llm

Llm App

LocalAI

AppSmith

Langchain Chatchat

NewPipe

Retrieval Based Voice Conversion WebUI

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

LoVA

Related projects

Stable Diffusion Webui

Deep Live Cam

Gpt Engineer

Anything Llm

Llm App

LocalAI

AppSmith

Langchain Chatchat

NewPipe

Retrieval Based Voice Conversion WebUI

GEO Services