Nemotron-Mini-4B-Instruct

A compact language model designed for role-playing, retrieval-augmented generation, and function invocation.

CommonProductProductivityCompact language modelDistillation
Nemotron-Mini-4B-Instruct is a compact language model developed by NVIDIA, optimized through distillation, pruning, and quantization for improved speed and ease of deployment on devices. It is a fine-tuned version of nvidia/Minitron-4B-Base, derived from Nemotron-4 15B via NVIDIA's large language model compression techniques. This instructional model is optimized for role-playing, retrieval-augmented question answering (RAG QA), and function invocation, supporting a context length of 4096 tokens and ready for commercial use.
Visit

Nemotron-Mini-4B-Instruct Visit Over Time

Monthly Visits

19075321

Bounce Rate

45.07%

Page per Visit

5.5

Visit Duration

00:05:32

Nemotron-Mini-4B-Instruct Visit Trend

Nemotron-Mini-4B-Instruct Visit Geography

Nemotron-Mini-4B-Instruct Traffic Sources

Nemotron-Mini-4B-Instruct Alternatives