Google: Gemini 2.5 Flash
GoogleID: google/gemini-2.5-flash
Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater accuracy and nuanced context handling. Additionally, Gemini 2.5 Flash is configurable through the "max tokens for reasoning" parameter, as described in the documentation (https://openrouter.ai/docs/use-cases/reasoning-tokens#max-tokens-for-reasoning).
Pricing per 1M Tokens
| Input (Prompt) | $0.30 |
| Output (Completion) | $2.50 |
| Cache Read | $0.03 |
| Cache Write | $0.08 |
| Image | $0.30 |
Specifications
| Context Length | 1.0M |
| Max Output Tokens | 66K |
| Input Modalities | File + Image + Text + Audio + Video |
| Output Modalities | Text |
| Tokenizer | Gemini |
| Instruct Type | N/A |
| Top Provider Context | 1.0M |
| Top Provider Max Output | 66K |
| Moderated | No |
Compare this model
See how Google: Gemini 2.5 Flash stacks up against other models.
More from Google
Last updated: March 23, 2026
First tracked: March 23, 2026