OpenAI: GPT-4o-mini

openai/gpt-4o-mini

Created Jul 18, 2024128,000 context
$0.15/M input tokens$0.6/M output tokens$0.217/K input imgs

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs.

As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than GPT-3.5 Turbo. It maintains SOTA intelligence, while being significantly more cost-effective.

GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferences common leaderboards.

Check out the launch announcement to learn more.

#multimodal

Providers for GPT-4o-mini

OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

Context
128K
Max Output
16K
Input
$0.15
Output
$0.6
Context
128K
Max Output
16K
Input
$0.15
Output
$0.6

Throughput

Apps using GPT-4o-mini

Top public apps this week using this model

1.
shapes inc
General purpose social agents
2.12Btokens
2.
QnA3.AI
new
636Mtokens
3.
Agility Writer
Generate ready-to-rank articles
308Mtokens
4.
ZimmWriter
new
280Mtokens
5.
Cline
Autonomous coding agent right in your IDE
257Mtokens
6.
bothub.chat
new
235Mtokens
7.
Galaxy.ai
new
210Mtokens
8.
OpenRouter Ruby Client
new
199Mtokens
9.
Gray Swan Arena
new
196Mtokens
10.
Roo Code
A whole dev team of AI agents in your editor
168Mtokens
11.
liteLLM
Open-source library to simplify LLM calls
165Mtokens
12.
FlowData
new
157Mtokens
13.
Cod3x
new
132Mtokens
14.
LauncherIOS
new
79.5Mtokens

Recent activity on GPT-4o-mini

Tokens processed per day

Jan 4Jan 10Jan 16Jan 22Jan 28Feb 3Feb 9Feb 15Feb 21Feb 27Mar 5Mar 11Mar 17Mar 23Mar 29Apr 4015B30B45B60B

Uptime stats for GPT-4o-mini

Uptime stats for GPT-4o-mini across all providers

When an error occurs in an upstream provider, we can recover by routing to another healthy provider, if your request filters allow it.

Learn more about our load balancing and customization options.

Sample code and API for GPT-4o-mini

OpenRouter normalizes requests and responses across providers for you.

OpenRouter provides an OpenAI-compatible completion API to 300+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

from openai import OpenAI

client = OpenAI(
  base_url="https://openrouter.ai/api/v1",
  api_key="<OPENROUTER_API_KEY>",
)

completion = client.chat.completions.create(
  extra_headers={
    "HTTP-Referer": "<YOUR_SITE_URL>", # Optional. Site URL for rankings on openrouter.ai.
    "X-Title": "<YOUR_SITE_NAME>", # Optional. Site title for rankings on openrouter.ai.
  },
  extra_body={},
  model="openai/gpt-4o-mini",
  messages=[
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is in this image?"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        }
      ]
    }
  ]
)
print(completion.choices[0].message.content)

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.

More models from OpenAI

  1. Error 404
    {"error":{"message":"Too many simultaneous queries. Maximum: 1000. ","code":404,"metadata":{"status":null,"location":"getEndpointsMedianLatencyGroupedByDate:query","message":"Too many simultaneous queries. Maximum: 1000. ","stack":"Error: Too many simultaneous queries. Maximum: 1000. \n at (vc/edge/function:164:51483)\n at (vc/edge/function:171:4561)\n at (vc/edge/function:171:2827)\n at (vc/edge/function:172:86)\n at (vc/edge/function:173:19796)\n at (vc/edge/function:214:20)\n at (vc/edge/function:179:3821)","debug":{},"metadata":{},"internal":{}}}}
GPT-4o-mini - API, Providers, Stats | OpenRouter