ReleaseMarch 3, 20261 min read

Google Launches Gemini 3.1 Flash-Lite — Cheapest Frontier Model Yet

Gemini 3.1 Flash-Lite costs $0.25/1M input tokens, runs 2.5x faster time-to-first-token than Gemini 2.5 Flash while matching its quality.

Google released its most cost-efficient model yet at just $0.25 per million input tokens.

Gemini 3.1 Flash-Lite delivers 2.5x faster time-to-first-token and 45% faster output than Gemini 2.5 Flash while matching its quality. It supports multimodal prompts with up to 1 million tokens of context.

For high-volume production workloads, this model offers an exceptional price-to-performance ratio. At $0.25/1M input tokens, it undercuts most competitors in its capability tier.

Check our cost calculator to see how Flash-Lite compares for your usage patterns.

Browse Models Compare All News

Google Launches Gemini 3.1 Flash-Lite — Cheapest Frontier Model Yet

Micron and Anthropic Unite to Revolutionize AI Memory Architecture with $1 Billion-Plus Deal

Explore