GPT 4o mini

Instruct

GPT-4o mini is a compact, high-efficiency variant of OpenAI’s multimodal GPT-4o model, ideal for applications that require low latency, cost-effective performance, and large-context processing. It excels in real-time scenarios like customer support chat, parallel API calls, and handling long inputs such as entire codebases or detailed conversation histories. Currently supporting text and vision in the API, GPT-4o mini is designed to expand into full multimodal capabilities—including image, audio, and video inputs and outputs—in future updates. With a 128K token context window, improved non-English handling via GPT-4o’s shared tokenizer, and a knowledge cutoff of October 2023, GPT-4o mini is a versatile model built for scalable and responsive AI solutions.

Provider	Context Size	Max Output	Latency	Speed	Cost

Data reflects historical performance over the past days.

API Usage

Seamlessly integrate our API into your project by following these simple steps:

Generate your API key from your profile.
Copy the example code and replace the placeholder with your API key or see our documentation.

You can choose from three automatic provider selection preferences:

speed – Prioritizes the provider with the fastest response time.
cost – Selects the most cost-efficient provider.
balanced – Offers an optimal mix of speed and cost.

API Usage