2024-11-08 13:59:34.AIbase.13.1k
Compact and Powerful! Pocket-sized Visual AI Model Moondream2: Just 1.6 Billion Parameters, Runs on Mobile Phones
Recently, a startup in Seattle named Moondream launched a compact visual language model called moondream2. Despite its small size, the model has shown outstanding performance in various benchmark tests and has garnered significant attention. As an open-source model, moondream2 is expected to enable local image recognition capabilities on smartphones. Moondream2 was officially released in March and can process both text and image inputs, possessing functions such as answering questions, text extraction (OCR), object counting, and item classification.