nemotron-mini

nemotron-mini:latest

83.5K Downloads Updated 8 months ago

A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.

tools 4b

Updated 8 months ago

8 months ago

ed76ab18784f · 2.7GB

model

archnemotron

parameters4.19B

quantizationQ4_K_M

2.7GB

template

{{- if (or .Tools .System) }}<extra_id_0>System {{ if .System }}{{ .System }} {{ end }} {{- if .To

773B

license

NVIDIA AI Foundation Models Community License Agreement IMPORTANT NOTICE – PLEASE READ AND AGREE B

15kB

Readme

Nemotron-Mini-4B-Instruct is a model for generating responses for roleplaying, retrieval augmented generation, and function calling. It is a small language model (SLM) optimized through distillation, pruning and quantization for speed and on-device deployment.

This instruct model is optimized for roleplay, RAG QA, and function calling in English. It supports a context length of 4,096 tokens. This model is ready for commercial use.

References

Blog

HuggingFace