Google: Gemini Pro Vision 1.0

google/gemini-pro-vision

Created Dec 13, 202316,384 context
$0.50/M input tokens$1.50/M output tokens$2.50/K input imgs

Google's flagship multimodal model, supporting image and video in text or chat prompts for a text or code response.

See the benchmarks and prompting guidelines from Deepmind.

Usage of Gemini is subject to Google's Gemini Terms of Use.

#multimodal

Providers for Gemini Pro Vision 1.0

OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.

Apps using Gemini Pro Vision 1.0

Top public apps this week using this model

Recent activity on Gemini Pro Vision 1.0

Tokens processed per day

Feb 6Feb 12Feb 18Feb 24Mar 2Mar 8Mar 14Mar 20Mar 26Apr 1Apr 7Apr 13Apr 19Apr 25May 1May 701.5M3M4.5M6M

Uptime stats for Gemini Pro Vision 1.0

Uptime stats for Gemini Pro Vision 1.0 across all providers

Sample code and API for Gemini Pro Vision 1.0

OpenRouter normalizes requests and responses across providers for you.

OpenRouter provides an OpenAI-compatible completion API to 300+ models & providers that you can call directly, or using the OpenAI SDK. Additionally, some third-party SDKs are available.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

from openai import OpenAI

client = OpenAI(
  base_url="https://openrouter.ai/api/v1",
  api_key="<OPENROUTER_API_KEY>",
)

completion = client.chat.completions.create(
  extra_headers={
    "HTTP-Referer": "<YOUR_SITE_URL>", # Optional. Site URL for rankings on openrouter.ai.
    "X-Title": "<YOUR_SITE_NAME>", # Optional. Site title for rankings on openrouter.ai.
  },
  extra_body={},
  model="google/gemini-pro-vision",
  messages=[
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is in this image?"
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        }
      ]
    }
  ]
)
print(completion.choices[0].message.content)

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.

More models from Google

    Google: Gemini Pro Vision 1.0 | OpenRouter