ReleaseMarch 3, 20261 min read
Google Launches Gemini 3.1 Flash-Lite — Cheapest Frontier Model Yet
Gemini 3.1 Flash-Lite costs $0.25/1M input tokens, runs 2.5x faster time-to-first-token than Gemini 2.5 Flash while matching its quality.
Google released its most cost-efficient model yet at just $0.25 per million input tokens.
Gemini 3.1 Flash-Lite delivers 2.5x faster time-to-first-token and 45% faster output than Gemini 2.5 Flash while matching its quality. It supports multimodal prompts with up to 1 million tokens of context.
For high-volume production workloads, this model offers an exceptional price-to-performance ratio. At $0.25/1M input tokens, it undercuts most competitors in its capability tier.
Check our cost calculator to see how Flash-Lite compares for your usage patterns.