AIbase
Product LibraryTool Navigation

NanoLLM

Public

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

Creat2024-04-01T05:38:34
Update2025-03-22T03:55:10
https://dusty-nv.github.io/NanoLLM/
255
Stars
3
Stars Increase