2024-11-06 09:53:34.AIbase.13.0k
Chinese Team Releases the World's Largest Open-source Multimodal Dataset, Achieving Record Performance with 2B Parameter Model
Recently, research teams from several Chinese scientific institutions have launched the super large-scale multimodal dataset named Infinity-MM, and based on this dataset, they have trained an outstanding AI model called Aquila-VL-2B. This breakthrough injects new momentum into the development of multimodal AI. The Infinity-MM dataset is astonishing in size, containing four major types of data: 10 million image descriptions, 24.4 million general visual instruction data, 6 million selected high-quality instruction data, and 3 million generated by G