AIbase
Product LibraryTool Navigation

Embedding-Quantization

Public

To make LLM faster we need faster retrieval system. Here comes Embedding Quantization. Embedding quantization is great technique to save cost on Vector DB, significantly faster retrieval while preserving retrieval performance.

Creat2024-04-30T18:48:55
Update2025-02-28T13:41:38
https://huggingface.co/spaces/SwastikM/Embedding-Quantization
6
Stars
0
Stars Increase