Updated March 26, 2026· Based on independent benchmark data
Kimi K2.5 (Reasoning) and Qwen3.5 397B A17B (Reasoning) are virtually tied on intelligence (46.8 vs 45.0). For speed, Qwen3.5 397B A17B (Reasoning) wins at 53 tok/s vs 34 tok/s.
| Metric | Kimi K2.5 (Reasoning) | Qwen3.5 397B A17B (Reasoning) |
|---|---|---|
| Intelligence Score | 46.8 | 45.0 |
| Coding Score | 39.5 | 41.3 |
| Math Score | N/A | N/A |
| Speed (tok/s) | 34 tok/s | 53 tok/s |
| Latency (TTFT) | 1.40s | 1.46s |
| Input Price / 1M tokens | $0.60 | $0.60 |
| Output Price / 1M tokens | $3.00 | $3.60 |
| Context Window | N/A |
Kimi K2.5 (Reasoning) and Qwen3.5 397B A17B (Reasoning) perform similarly on overall intelligence, scoring 46.8 and 45.0 respectively. Their coding capabilities are closely matched (39.5 vs 41.3).
Qwen3.5 397B A17B (Reasoning) generates output significantly faster at 53 tok/s compared to Kimi K2.5 (Reasoning)'s 34 tok/s, making it 1.6x faster for streaming responses. Time to first token is 1.40s for Kimi K2.5 (Reasoning) vs 1.46s for Qwen3.5 397B A17B (Reasoning), which affects perceived responsiveness in interactive applications.
Qwen3.5 397B A17B (Reasoning) is more affordable at $0.60/1M input tokens ($3.60/1M output), while Kimi K2.5 (Reasoning) costs $0.60/1M input ($3.00/1M output). For a typical workload of 100 requests per day at 2,000 tokens each, Kimi K2.5 (Reasoning) would cost approximately $3.60/month vs $3.60/month for Qwen3.5 397B A17B (Reasoning) in input costs alone.
Choose Qwen3.5 397B A17B (Reasoning) when you need faster output (53 tok/s).
Qwen3.5 397B A17B (Reasoning) scores higher on coding benchmarks (41.3 vs 39.5), making it the better choice for programming tasks.
Qwen3.5 397B A17B (Reasoning) is cheaper at $0.60/1M input tokens vs $0.60/1M for Kimi K2.5 (Reasoning).
Qwen3.5 397B A17B (Reasoning) is faster, producing output at 53 tok/s compared to Kimi K2.5 (Reasoning)'s 34 tok/s.
No, Kimi K2.5 (Reasoning) does not support image input. Neither model supports image input.
Data last synced: March 26, 2026
| N/A |
| Max Output Tokens | N/A | N/A |
| Input Modalities | Text | Text |
| Output Modalities | Text | Text |
| Free Tier | No | No |
Both models perform similarly on intelligence benchmarks. Choose based on specific needs: pricing, speed, context window, or provider ecosystem.