SKLM Llama-3 70B is a German fine-tune of the powerful Meta-Llama-3-70B-Instruct. The base Llama-3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks, while the Sauerkraut fine-tune offers advanced support for German. The GPTQ quantization of the Llama 3 model offers a significant memory requirement reduction with a slight trade off in inference quality.
For instructions on accessing this model or initializing it via API, please refer to our docs.