GPT 5 mini

Instruct
GPT 5 mini is designed to deliver the core instruction-following and safety-tuning benefits of GPT-5 while reducing latency and cost. As the successor to OpenAI's o4-mini model, it provides a faster and more efficient option for applications requiring well-defined tasks, precise prompts, and orchestrating tool calls in real time, such as customer support automation. This makes it an excellent choice for high-volume, cost-sensitive deployments where speed and responsiveness are critical.
Provider
Context Size
Throughput
Latency
Input Cost
Output Cost

Usage

Integrate our API into your project and let requests automatically route to the best endpoint. Just follow these simple steps:

  1. Generate your API key.
  2. Make your first request using the example code below.

For more details see our documentation.

>Enter ↵