Google: Gemini 2.5 Flash Preview

google/gemini-2.5-flash-preview

Created Apr 17, 20251,048,576 context
$0.15/M input tokens$0.60/M output tokens$0.619/K input imgs

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling.

Note: This model is available in two variants: thinking and non-thinking. The output pricing varies significantly depending on whether the thinking capability is active. If you select the standard variant (without the ":thinking" suffix), the model will explicitly avoid generating thinking tokens.

To utilize the thinking capability and receive thinking tokens, you must choose the ":thinking" variant, which will then incur the higher thinking-output pricing.

Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).

Uptime stats for Gemini 2.5 Flash Preview

Uptime stats for Gemini 2.5 Flash Preview across all providers

When an error occurs in an upstream provider, we can recover by routing to another healthy provider, if your request filters allow it.

Learn more about our load balancing and customization options.

More models from Google

    Google: Gemini 2.5 Flash Preview – Uptime and Availability | OpenRouter