Nemotron-Mini-4B-Instruct
A compact language model designed for role-playing, retrieval-augmented generation, and function invocation.
CommonProductProductivityCompact language modelDistillation
Nemotron-Mini-4B-Instruct is a compact language model developed by NVIDIA, optimized through distillation, pruning, and quantization for improved speed and ease of deployment on devices. It is a fine-tuned version of nvidia/Minitron-4B-Base, derived from Nemotron-4 15B via NVIDIA's large language model compression techniques. This instructional model is optimized for role-playing, retrieval-augmented question answering (RAG QA), and function invocation, supporting a context length of 4096 tokens and ready for commercial use.
Nemotron-Mini-4B-Instruct Visit Over Time
Monthly Visits
20899836
Bounce Rate
46.04%
Page per Visit
5.2
Visit Duration
00:04:57