AIbase
Product LibraryTool Navigation

Edge-LLM

Public

Optimized Qwen2.5-3B using GPTQ, reducing size from 5.75GB → 1.93GB and improving inference speed. Ideal for efficient edge AI deployments.

Creat2025-03-30T22:34:05
Update2025-04-04T00:59:34
0
Stars
0
Stars Increase

Related projects