AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

300x Volume Reduction! Hugging Face Launches SmolVLM Model: Compact Intelligence, AI Running on Mobile

AIbase基地

Published inAI News · 5 min read · Jan 24, 2025

244

Hugging Face has launched an impressive AI model — SmolVLM. This visual language model is small enough to run on mobile devices and other compact hardware, yet it outperforms predecessor models that require large data centers.

The SmolVLM-256M model requires less than 1GB of GPU memory, while its performance surpasses that of its predecessor, the Idefics80B model, which is 300 times larger. This marks a significant advancement in practical AI deployment.

According to Andres Malafieotti, a machine learning research engineer at Hugging Face, the SmolVLM model not only enters the market but also significantly reduces computational costs for businesses. "Our previously released Idefics80B was the first open-source video language model in August 2023, and the launch of SmolVLM has achieved a 300-fold reduction in size while improving performance," Malafieotti stated in an interview with Entrepreneur Daily.

The release of the SmolVLM model comes at a critical time when businesses face high computational costs in implementing AI systems. The new model is available in 256M and 500M parameter sizes and can process images and understand visual content at speeds previously unimaginable. The smallest version can handle 16 instances per second with only 15GB of memory, making it particularly suitable for companies that need to process large amounts of visual data. For a medium-sized company processing 1 million images per month, this translates to significant annual savings in computational costs.

Additionally, IBM has partnered with Hugging Face to integrate the 256M model into its document processing software, Docling. Although IBM has abundant computational resources, using a smaller model allows it to process millions of documents more efficiently and at a lower cost.

The Hugging Face team has successfully reduced the model size without sacrificing performance through technical innovations in visual processing and language components. They replaced the original 400M parameter visual encoder with a 93M parameter version and implemented more aggressive token compression techniques. These innovations enable small businesses and startups to launch complex computer vision products in a short time, significantly lowering infrastructure costs.

The training dataset for SmolVLM includes 170 million training examples, with nearly half dedicated to document processing and image labeling. These developments not only reduce costs but also bring new application possibilities for businesses, enhancing their capabilities in visual search to unprecedented levels.

This advancement by Hugging Face challenges traditional views on the relationship between model size and capability. SmolVLM demonstrates that small, efficient architectures can also deliver outstanding performance, suggesting that the future of AI development may focus less on larger models and more on more flexible and efficient systems.

Model: https://huggingface.co/blog/smolervlm

Key Points:
🌟 The SmolVLM model launched by Hugging Face can run on mobile devices, outperforming the 300 times larger Idefics80B model.
💰 The SmolVLM model helps businesses significantly reduce computational costs, achieving a processing speed of 16 instances per second.
🚀 The technological innovations of this model allow small businesses and startups to launch complex computer vision products in a short time.

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team