Model-Quantization

Public

Quantization is a technique to reduce the computational and memory costs of running inference by representing the weights and activations with low-precision data types like 8-bit integer (int8) instead of the usual 32-bit floating point (float32).

ml model-quantization quantization

Creat：2023-08-07T12:38:21

Update：2025-02-05T11:01:43

Stars

Stars Increase

Related projects

Tensorflow

deep-learning

An Open Source Machine Learning Framework for Everyone

192497

2年前

+14today

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

152763

2年前

+57today

Generative Ai For Beginners

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

102038

9个月前

+28today

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

79046

1年前

+69today

Vllm

Hot

amd

A high-throughput and memory-efficient inference and serving engine for LLMs

63529

1年前

+80today

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

62793

8个月前

+86today

Yolov5

coreml

YOLOv5 ? in PyTorch > ONNX > CoreML > TFLite

56096

1年前

+20today

Made With ML

data-engineering

Learn how to design, develop, deploy and iterate on production-grade ML applications.

44466

1年前

+16today

TTS

deep-learning

?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

43524

10个月前

+9today

ColossalAI

Making large AI models cheaper, faster and more accessible

41252

8个月前

+2today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Model-Quantization

Related projects

Tensorflow

Transformers

Generative Ai For Beginners

LLMs From Scratch

Vllm

LLaMA Factory

Yolov5

Made With ML

TTS

ColossalAI

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Model-Quantization

Related projects

Tensorflow

Transformers

Generative Ai For Beginners

LLMs From Scratch

Vllm

LLaMA Factory

Yolov5

Made With ML

TTS

ColossalAI

GEO Services