Hermes 3 Llama 405B

Instruct
Hermes 3 405B is a full-parameter finetune of the Llama 3.1 405B foundation model. Compared with Hermes 2, it offers major gains in multi-turn dialogue, role-play realism, code generation, and structured outputs—including reliable function calling. The model features advanced agentic capabilities and strong steering hooks, giving users fine-grained control while preserving long-context consistency. Designed as a versatile generalist assistant, Hermes 3 405B aligns closely with user intent across a wide range of tasks.
Provider
Context Size
Max Output
Latency
Speed
Cost

Data reflects historical performance over the past days.

API Usage

Seamlessly integrate our API into your project by following these simple steps:

  1. Generate your API key from your profile.
  2. Copy the example code and replace the placeholder with your API key or see our documentation.

You can choose from three automatic provider selection preferences:

  • speed – Prioritizes the provider with the fastest response time.
  • cost – Selects the most cost-efficient provider.
  • balanced – Offers an optimal mix of speed and cost.

API Usage