2025-01-15 08:41:23.AIbase.14.7k
Alibaba Damo Academy Launches E-commerce Multi-modal Large Model Valley 2
Recently, Alibaba Damo Academy launched a multi-modal large language model named Valley 2, designed for e-commerce scenarios. It aims to enhance performance across various fields and expand the application boundaries of e-commerce and short video scenarios through a scalable vision-language architecture. Valley 2 utilizes Qwen 2.5 as its LLM backbone, paired with the SigLIP-384 visual encoder, incorporating MLP layers and convolution for efficient feature transformation.