Updated March 26, 2026· Based on independent benchmark data
Gemini 3.1 Pro Preview leads in intelligence with a score of 57.2 vs 19.7. QwQ 32B is 3.0x cheaper at $0.66/1M tokens vs $2.00/1M. For speed, Gemini 3.1 Pro Preview wins at 113 tok/s vs 32 tok/s.
| Metric | QwQ 32B | Gemini 3.1 Pro Preview |
|---|---|---|
| Intelligence Score | 19.7 | 57.2 |
| Coding Score | N/A | 55.5 |
| Math Score | 29.0 | N/A |
| Speed (tok/s) | 32 tok/s | 113 tok/s |
| Latency (TTFT) | 0.45s | 23.84s |
| Input Price / 1M tokens | $0.66 | $2.00 |
| Output Price / 1M tokens | $1.00 | $12 |
| Context Window | N/A | 1.0M |
| Max Output Tokens | N/A | N/A |
| Input Modalities | Text | Audio + File + Image + Text + Video |
Gemini 3.1 Pro Preview outperforms QwQ 32B on the intelligence index with a score of 57.2 compared to 19.7.
Gemini 3.1 Pro Preview generates output significantly faster at 113 tok/s compared to QwQ 32B's 32 tok/s, making it 3.5x faster for streaming responses. Time to first token is 0.45s for QwQ 32B vs 23.84s for Gemini 3.1 Pro Preview, which affects perceived responsiveness in interactive applications.
QwQ 32B is more affordable at $0.66/1M input tokens ($1.00/1M output), while Gemini 3.1 Pro Preview costs $2.00/1M input ($12/1M output). That makes Gemini 3.1 Pro Preview 3.0x more expensive per token, which can add up significantly at scale. For a typical workload of 100 requests per day at 2,000 tokens each, QwQ 32B would cost approximately $3.96/month vs $12.00/month for Gemini 3.1 Pro Preview in input costs alone.
Choose QwQ 32B when you need lower cost. Choose Gemini 3.1 Pro Preview when you need higher intelligence (57.2), faster output (113 tok/s).
QwQ 32B is cheaper at $0.66/1M input tokens vs $2.00/1M for Gemini 3.1 Pro Preview.
Gemini 3.1 Pro Preview is faster, producing output at 113 tok/s compared to QwQ 32B's 32 tok/s.
No, QwQ 32B does not support image input. However, Gemini 3.1 Pro Preview does support images.
It depends on your priorities. Gemini 3.1 Pro Preview scores higher on intelligence (57.2), but QwQ 32B may be better for specific use cases like budget-conscious projects or speed-critical applications.
Data last synced: March 26, 2026
| Output Modalities |
| Text |
| Text |
| Free Tier | No | No |