Updated March 26, 2026· Based on independent benchmark data
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) leads in intelligence with a score of 51.7 vs 46.8. For speed, Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) wins at 71 tok/s vs 0 tok/s.
| Metric | GLM-5-Turbo | Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) |
|---|---|---|
| Intelligence Score | 46.8 | 51.7 |
| Coding Score | 36.8 | 50.9 |
| Math Score | N/A | N/A |
| Speed (tok/s) | 0 tok/s | 71 tok/s |
| Latency (TTFT) | 0.00s | 32.16s |
| Input Price / 1M tokens | Free | $3.00 |
| Output Price / 1M tokens | Free | $15 |
| Context Window |
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) outperforms GLM-5-Turbo on the intelligence index with a score of 51.7 compared to 46.8. For coding tasks, Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) has the edge with a coding score of 50.9 vs 36.8.
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) generates output significantly faster at 71 tok/s compared to GLM-5-Turbo's 0 tok/s, making it Infinityx faster for streaming responses. Time to first token is 0.00s for GLM-5-Turbo vs 32.16s for Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort), which affects perceived responsiveness in interactive applications.
GLM-5-Turbo is completely free, while Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) costs $3.00/1M input tokens and $15/1M output tokens.
Choose Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) when you need higher intelligence (51.7), stronger coding performance (50.9), faster output (71 tok/s).
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) scores higher on coding benchmarks (50.9 vs 36.8), making it the better choice for programming tasks.
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) is faster, producing output at 71 tok/s compared to GLM-5-Turbo's 0 tok/s.
No, GLM-5-Turbo does not support image input. Neither model supports image input.
It depends on your priorities. Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) scores higher on intelligence (51.7), but GLM-5-Turbo may be better for specific use cases like cost savings or speed-critical applications.
Data last synced: March 26, 2026
| N/A |
| N/A |
| Max Output Tokens | N/A | N/A |
| Input Modalities | Text | Text |
| Output Modalities | Text | Text |
| Free Tier | Yes | No |