The Mistral-Nemo-Instruct-2407 Large Language Model is an instruct fine-tuned version of the Mistral-Nemo-Base-2407, a 12B pretrained generative text model. Trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size. The FP8 quantization decreases the model's memory requirements and enhances its speed, with minimal impact on accuracy.
For instructions on accessing this model or initializing it via API, please refer to our docs.