LaVi-Bridge
Connects different language models and generative visual models for text-to-image generation
CommonProductImageText-to-Image GenerationLanguage Models
LaVi-Bridge is a bridge model designed for text-to-image diffusion models, enabling the connection of various pre-trained language models and generative visual models. It utilizes LoRA and adapters, providing a flexible and plug-and-play approach without modifying the weights of the original language and visual models. Compatible with a variety of language and generative visual models, it accommodates different architectures. Within this framework, we demonstrate that integrating more advanced modules (such as more sophisticated language models or generative visual models) can significantly improve capabilities like text alignment or image quality. The model has been extensively evaluated, confirming its effectiveness.
LaVi-Bridge Visit Over Time
Monthly Visits
1370
Bounce Rate
43.76%
Page per Visit
1.0
Visit Duration
00:00:00