Mistral Nemo 2407

Instruct

The Mistral-Nemo-Instruct-2407 Large Language Model is an instruct fine-tuned version of the Mistral-Nemo-Base-2407, a 12B pretrained generative text model. Trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size. The FP8 quantization decreases the model's memory requirements and enhances its speed, with minimal impact on accuracy.

For instructions on accessing this model or initializing it via API, please refer to our docs.

Configuration

For more details about _model_provder--model_name, visit the model's page on Hugging Face.

NVIDIA L40S x 1

Slider

Context defines the maximum tokens the model can process at once. Smaller values improve speed but risk truncating input. Adjust it to balance performance and input needs.

This website requires your consent to use cookies for traffic analytics. Read more in our privacy policy.