AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Meta AI's New Quantized Version Llama 3.2: 2x Speed Increase and 56% Size Reduction, Now Runnable on Mobile Devices

AIbase基地

Published inAI News · 5 min read · Oct 25, 2024

492

Recently, Meta AI introduced the new quantized Llama3.2 model, available in 1B and 3B versions, designed for fine-tuning, distillation, and deployment across various devices.

Previously, despite significant achievements in natural language understanding and generation by models like Llama3, their large size and high computational demands made them difficult for many organizations to utilize. Long training times, high energy consumption, and reliance on expensive hardware undoubtedly widened the gap between tech giants and small businesses.

One of the key features of Llama3.2 is its support for multilingual text and image processing. The 1B and 3B models, after quantization, can reduce their size by an average of 56% and decrease memory usage by 41%, while achieving a 2-3x speed improvement, making them ideal for mobile devices and edge computing environments.

Specifically, these models employ 8-bit and 4-bit quantization strategies, reducing the original 32-bit floating-point precision for weights and activations, thereby significantly lowering memory requirements and computational demands. This means the quantized Llama3.2 models can run on standard consumer-grade GPUs or even CPUs with virtually no performance loss.

Imagine users now being able to perform various intelligent applications on their phones, such as summarizing discussions in real-time or invoking calendar tools, all thanks to these lightweight models.

Meta AI has also partnered with industry leaders like Qualcomm and MediaTek to deploy these models on Arm CPU-based system-on-chips, ensuring efficient use across a wide range of devices. Early tests show that the quantized Llama3.2 achieves 95% of the Llama3 model's performance in major natural language processing benchmarks, with nearly 60% less memory usage. This is significant for businesses and researchers looking to implement AI without heavy infrastructure investments.

The quantized Llama3.2 model launched by Meta AI not only takes a significant step towards improving the accessibility of AI technology but also addresses some core issues in the application of large-scale language models, such as cost and environmental impact. This efficient model development trend is set to drive sustainable and inclusive AI development in the future.

Model access: https://www.llama.com/

Key Points:

🌟 Meta AI's quantized Llama3.2 model, available in 1B and 3B versions, significantly reduces model size and computational resource requirements.

⚡️ Model inference speed is increased by 2-4 times, suitable for consumer-grade hardware, ideal for real-time applications.

🌍 Quantized Llama3.2 performs almost as well as the original model in natural language processing, enabling businesses and researchers to implement AI applications.

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team