2025-02-21 17:23:48.AIbase.15.6k
Ali International Open Source Ovis2 Series Multimodal Large Language Model with Six Versions
Ovis2 is the latest version of the Ovis series models proposed by Alibaba's international team. Compared to the previous version 1.6, Ovis2 has significant improvements in data construction and training methods. It not only enhances the capacity density of small models but also greatly improves chain of thought (CoT) reasoning capabilities through instruction fine-tuning and preference learning. Additionally, Ovis2 introduces video and multi-image processing capabilities, and enhances multilingual abilities and OCR capabilities in complex scenarios, significantly increasing the model's practicality.