Compare pricing for 348+ AI models side by side. From free open-source models to premium enterprise options, find the right price-performance balance for your use case.
53 models
5 models
AI model pricing is based on tokens, which are the fundamental units of text that language models process. One token is roughly 3/4 of an English word, so 1 million tokens equals approximately 750,000 words or about 1,500 pages of text.
Most providers charge separately for input tokens (the text you send to the model) and output tokens (the text the model generates). Output tokens typically cost 2-4x more than input tokens because generating new text requires more computational work than reading existing text. Each output token must be produced sequentially through a full forward pass of the neural network.
To estimate your monthly cost, multiply your average daily requests by 30, then by your average tokens per request. For example, 100 requests per day at 2,000 tokens each equals 6 million tokens per month. At $3/1M tokens, that would be $18/month. Some providers also offer cached token pricing at a discount for repeated prompts, and batch processing at lower rates for non-real-time workloads.
Free models are a great starting point for experimentation and low-volume use cases. As your needs grow, budget and mid-range models offer an excellent balance of quality and cost. Premium and enterprise models deliver the highest benchmark scores for tasks demanding maximum accuracy, like complex coding or research analysis.
100 requests/day at 2,000 tokens each (input + output)
| Model | Provider | Input/1M | Output/1M | Est. Monthly |
|---|---|---|---|---|
| Gemma 3n E4B Instruct | $0.02 | $0.04 | $0.36/mo | |
| Granite 3.3 8B (Non-reasoning) | IBM | $0.03 | $0.25 | $1.68/mo |
| LFM2 24B A2B | Liquid AI | $0.03 | $0.12 | $0.90/mo |
| Nova Micro | Amazon | $0.04 | $0.14 | $1.05/mo |
| NVIDIA Nemotron Nano 9B V2 (Reasoning) |
| Model | Provider | Input/1M | Output/1M | Context | Intelligence |
|---|---|---|---|---|---|
| LFM2.5-1.2B-ThinkingFree | Liquid AI | Free | Free | N/A | 8.1 |
| Devstral Small 2Free | Mistral | Free | Free | N/A | 19.5 |
| LFM2 2.6BFree | Liquid AI | Free | Free | N/A |
AI models charge per token, where a token is roughly 3/4 of a word. Prices are quoted per million tokens. Most models charge separately for input (prompt) tokens and output (completion) tokens, with output typically costing 2-4x more than input.
Output tokens require the model to generate new text, which is computationally more expensive than reading input. The model must perform inference for each output token sequentially, while input tokens can be processed in parallel.
Among paid models with benchmark data, Gemma 3n E4B Instruct offers some of the lowest pricing at $0.02/1M input tokens. There are also many free models available.
For a typical individual workload of 100 requests per day at 2,000 tokens each, monthly costs range from $0 (free models) to $50+ (premium models). Enterprise usage at higher volumes can cost significantly more. Use the cost examples on this page to estimate your specific usage.
| NVIDIA |
| $0.04 |
| $0.16 |
| $1.20/mo |
| LFM2.5-VL-1.6BFree | Liquid AI | Free | Free | N/A | 6.2 |
| LFM2.5-1.2B-InstructFree | Liquid AI | Free | Free | N/A | 8.0 |
| Olmo 3.1 32B ThinkFree | Allen Institute for AI | Free | Free | N/A | 13.9 |
| Molmo2-8BFree | Allen Institute for AI | Free | Free | N/A | 7.3 |
| Molmo 7B-DFree | Allen Institute for AI | Free | Free | N/A | 9.2 |
| Gemma 3 1B InstructFree | Free | Free | N/A | 5.5 |
| SonarFree | Perplexity | Free | Free | 127K | 15.5 |
| Step3 VL 10BFree | StepFun | Free | Free | N/A | 15.4 |
| Kimi Linear 48B A3B InstructFree | Kimi | Free | Free | N/A | 14.4 |
| Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)Free | NVIDIA | Free | Free | N/A | 14.4 |
| NVIDIA Nemotron 3 Nano 4BFree | NVIDIA | Free | Free | N/A | 14.7 |
| Llama 3.3 Nemotron Super 49B v1 (Reasoning)Free | NVIDIA | Free | Free | N/A | 18.5 |
| Solar Pro 2 (Reasoning)Free | Upstage | Free | Free | N/A | 14.9 |
| Devstral 2Free | Mistral | Free | Free | N/A | 22.0 |
| Solar Open 100B (Reasoning)Free | Upstage | Free | Free | N/A | 21.7 |
| LFM2 8B A1BFree | Liquid AI | Free | Free | N/A | 7.0 |
| Sonar ProFree | Perplexity | Free | Free | 200K | 15.2 |
| DeepSeek V3.2 SpecialeFree | DeepSeek | Free | Free | 164K | 29.4 |
| DeepSeek R1 0528 Qwen3 8BFree | DeepSeek | Free | Free | 128K | 16.4 |
| R1 1776Free | Perplexity | Free | Free | N/A | 12.0 |
| Falcon-H1R-7BFree | TII UAE | Free | Free | N/A | 15.8 |
| Phi-4 Multimodal InstructFree | Microsoft Azure | Free | Free | N/A | 10.0 |
| Phi-4 Mini InstructFree | Microsoft Azure | Free | Free | N/A | 8.4 |
| Qwen3 4B 2507 (Reasoning)Free | Alibaba | Free | Free | N/A | 18.2 |
| Qwen2.5 Instruct 32BFree | Alibaba | Free | Free | N/A | 13.2 |
| Qwen3 4B 2507 InstructFree | Alibaba | Free | Free | N/A | 12.9 |
| Qwen3 VL 4B (Reasoning)Free | Alibaba | Free | Free | N/A | 13.7 |
| GPT-4o (ChatGPT)Free | OpenAI | Free | Free | N/A | 14.1 |
| Qwen Chat 72BFree | Alibaba | Free | Free | N/A | 8.8 |
| DBRX InstructFree | Databricks | Free | Free | N/A | 8.3 |
| Qwen1.5 Chat 110BFree | Alibaba | Free | Free | N/A | 9.5 |
| Llama 2 Chat 70BFree | Meta | Free | Free | N/A | 8.4 |
| Llama 2 Chat 13BFree | Meta | Free | Free | N/A | 8.4 |
| Gemini 2.0 Pro Experimental (Feb '25)Free | Free | Free | N/A | 18.1 |
| Gemini 2.0 Flash (experimental)Free | Free | Free | N/A | 16.8 |
| Gemini 1.5 Pro (Sep '24)Free | Free | Free | N/A | 16.0 |
| Gemini 2.0 Flash-Lite (Preview)Free | Free | Free | N/A | 14.5 |
| Gemini 1.0 ProFree | Free | Free | 33K | 8.5 |
| Solar Pro 2 (Preview) (Reasoning)Free | Upstage | Free | Free | N/A | 18.8 |
| Gemini 1.5 Flash (Sep '24)Free | Free | Free | N/A | 13.8 |
| Gemini 1.5 Flash-8BFree | Free | Free | N/A | 11.1 |
| LFM2 1.2BFree | Liquid AI | Free | Free | N/A | 6.3 |
| Gemini 2.5 Flash Preview (Sep '25) (Reasoning)Free | Free | Free | N/A | 31.1 |
| LFM 40BFree | Liquid AI | Free | Free | N/A | 8.8 |
| Gemini 1.0 UltraFree | Free | Free | N/A | 10.1 |
| Gemini 2.0 Flash-Lite (Feb '25)Free | Free | Free | N/A | 14.7 |
| Qwen2.5 Coder Instruct 7B Free | Alibaba | Free | Free | N/A | 10.0 |
| Gemini 2.0 Flash Thinking Experimental (Jan '25)Free | Free | Free | N/A | 19.6 |
| Gemma 3n E4B Instruct Preview (May '25)Free | Free | Free | N/A | 10.1 |
| PALM-2Free | Free | Free | N/A | 8.6 |
| Gemini 2.5 Pro Preview (Mar' 25)Free | Free | Free | N/A | 30.3 |
| Phi-3 Mini Instruct 3.8BFree | Microsoft Azure | Free | Free | N/A | 10.1 |
| OpenChat 3.5 (1210)Free | OpenChat | Free | Free | N/A | 8.3 |
| Grok 2 (Dec '24)Free | xAI | Free | Free | N/A | 13.9 |
| Grok 3 Reasoning BetaFree | xAI | Free | Free | N/A | 21.6 |
| Grok BetaFree | xAI | Free | Free | N/A | 13.3 |
| Claude InstantFree | Anthropic | Free | Free | N/A | 7.4 |
| DeepSeek-V2-ChatFree | DeepSeek | Free | Free | N/A | 9.1 |
| DeepSeek Coder V2 Lite InstructFree | DeepSeek | Free | Free | N/A | 8.5 |
| DeepSeek R1 Distill Qwen 1.5BFree | DeepSeek | Free | Free | N/A | 9.1 |
| DeepSeek LLM 67B Chat (V1)Free | DeepSeek | Free | Free | N/A | 8.4 |
| DeepSeek R1 Distill Llama 8BFree | DeepSeek | Free | Free | N/A | 12.1 |
| GPT-4o (March 2025, chatgpt-4o-latest)Free | OpenAI | Free | Free | N/A | 18.6 |
| Claude 2.1Free | Anthropic | Free | Free | N/A | 9.3 |
| Claude 2.0Free | Anthropic | Free | Free | N/A | 9.1 |
| GPT-4.5 (Preview)Free | OpenAI | Free | Free | N/A | 20.0 |
| Sonar Reasoning ProFree | Perplexity | Free | Free | 128K | 24.6 |
| Mixtral 8x22B InstructFree | Mistral | Free | Free | N/A | 9.8 |
| o1-miniFree | OpenAI | Free | Free | N/A | 20.4 |
| Doubao Seed CodeFree | ByteDance Seed | Free | Free | N/A | 33.5 |
| Ring-1TFree | Inclusionai | Free | Free | N/A | 22.8 |
| Qwen3 VL 4B InstructFree | Alibaba | Free | Free | N/A | 9.6 |
| Ling-1TFree | Inclusionai | Free | Free | N/A | 19.0 |
| Mistral SabaFree | Mistral | Free | Free | N/A | 12.1 |
| Sonar ReasoningFree | Perplexity | Free | Free | 127K | 17.9 |
| Magistral Small 1Free | Mistral | Free | Free | N/A | 16.8 |
| Ling-mini-2.0Free | Inclusionai | Free | Free | N/A | 9.2 |
| Magistral Medium 1Free | Mistral | Free | Free | N/A | 18.8 |
| Qwen3.5 4B (Reasoning)Free | Alibaba | Free | Free | N/A | 27.1 |
| MiniMax M1 40kFree | MiniMax | Free | Free | N/A | 20.9 |
| Qwen3.5 2B (Reasoning)Free | Alibaba | Free | Free | N/A | 16.3 |
| Qwen3.5 0.8B (Reasoning)Free | Alibaba | Free | Free | N/A | 10.5 |
| Jamba Reasoning 3BFree | AI21 Labs | Free | Free | N/A | 9.6 |
| Llama 3.1 Tulu3 405BFree | Allen Institute for AI | Free | Free | N/A | 14.1 |
| OLMo 2 32BFree | Allen Institute for AI | Free | Free | N/A | 10.6 |
| Jamba 1.7 MiniFree | AI21 Labs | Free | Free | N/A | 8.1 |
| Apriel-v1.6-15B-ThinkerFree | ServiceNow | Free | Free | N/A | 27.6 |
| DeepSeek R1 Distill Qwen 14BFree | DeepSeek | Free | Free | N/A | 15.8 |
| DeepSeek-V2.5 (Dec '24)Free | DeepSeek | Free | Free | N/A | 12.5 |
| DeepSeek-Coder-V2Free | DeepSeek | Free | Free | N/A | 10.6 |
| Tiny Aya GlobalFree | Cohere | Free | Free | N/A | 4.7 |
| Qwen Chat 14BFree | Alibaba | Free | Free | N/A | 7.4 |
| GLM-5-TurboFree | Z AI | Free | Free | N/A | 46.8 |
| Nanbeige4.1-3BFree | Nanbeige | Free | Free | N/A | 16.1 |
| Grok-1Free | xAI | Free | Free | N/A | 11.7 |
| Tri-21B-ThinkFree | Trillion Labs | Free | Free | N/A | 18.6 |
| Tri-21B-think PreviewFree | Trillion Labs | Free | Free | N/A | 20.0 |
| LongCat Flash LiteFree | LongCat | Free | Free | N/A | 23.9 |
| HyperCLOVA X SEED Think (32B)Free | Naver | Free | Free | N/A | 23.7 |
| OLMo 2 7BFree | Allen Institute for AI | Free | Free | N/A | 9.3 |
| Olmo 3 32B ThinkFree | Allen Institute for AI | Free | Free | N/A | 12.1 |
| Mi:dm K 2.5 ProFree | Korea Telecom | Free | Free | N/A | 23.1 |
| K2 Think V2Free | MBZUAI Institute of Foundation Models | Free | Free | N/A | 24.1 |
| Sarvam M (Reasoning)Free | Sarvam | Free | Free | N/A | 8.4 |
| K2-V2 (high)Free | MBZUAI Institute of Foundation Models | Free | Free | N/A | 20.6 |
| Motif-2-12.7B-ReasoningFree | Motif Technologies | Free | Free | N/A | 19.1 |
| INTELLECT-3Free | Prime Intellect | Free | Free | N/A | 22.2 |
| Sarvam 30B (high)Free | Sarvam | Free | Free | N/A | 12.3 |
| Sarvam 105B (high)Free | Sarvam | Free | Free | N/A | 18.2 |
| ERNIE 5.0 Thinking PreviewFree | Baidu | Free | Free | N/A | 29.1 |
| MiMo-V2-OmniFree | Xiaomi | Free | Free | N/A | 43.4 |
| K-EXAONE (Reasoning)Free | LG AI Research | Free | Free | N/A | 32.1 |
| Llama 65BFree | Meta | Free | Free | N/A | 7.4 |
| EXAONE 4.0 32B (Reasoning)Free | LG AI Research | Free | Free | N/A | 16.7 |
| Apriel-v1.5-15B-ThinkerFree | ServiceNow | Free | Free | N/A | 28.3 |
| Exaone 4.0 1.2B (Reasoning)Free | LG AI Research | Free | Free | N/A | 8.3 |
| DeepHermes 3 - Mistral 24B Preview (Non-reasoning)Free | Nous Research | Free | Free | N/A | 10.9 |
| DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)Free | Nous Research | Free | Free | N/A | 7.6 |
| Granite 4.0 350MFree | IBM | Free | Free | N/A | 6.1 |
| Arctic InstructFree | Snowflake | Free | Free | N/A | 8.8 |
| Granite 4.0 H 1BFree | IBM | Free | Free | N/A | 8.0 |
| Qwen2.5 Instruct 72BFree | Alibaba | Free | Free | N/A | 15.6 |
| Qwen2.5 Coder Instruct 32BFree | Alibaba | Free | Free | N/A | 12.9 |
| Qwen2 Instruct 72BFree | Alibaba | Free | Free | N/A | 11.7 |
| Granite 4.0 H 350MFree | IBM | Free | Free | N/A | 5.4 |
| Granite 4.0 1BFree | IBM | Free | Free | N/A | 7.3 |
| Gemma 3 270MFree | Free | Free | N/A | 7.7 |
| Granite 4.0 MicroFree | IBM | Free | Free | N/A | 7.7 |
| Gemma 3 4B InstructFree | Free | Free | N/A | 6.3 |
| Gemma 3n E2B InstructFree | Free | Free | N/A | 4.8 |
| Gemma 3 27B InstructFree | Free | Free | N/A | 10.3 |
| Gemma 3 12B InstructFree | Free | Free | N/A | 8.8 |
| Olmo 3 7B ThinkFree | Allen Institute for AI | Free | Free | N/A | 9.4 |
| Gemma 3n E4B Instruct | $0.02 | $0.04 | N/A | 6.4 |
| Granite 3.3 8B (Non-reasoning) | IBM | $0.03 | $0.25 | N/A | 7.0 |
| LFM2 24B A2B | Liquid AI | $0.03 | $0.12 | N/A | 10.5 |
| Nova Micro | Amazon | $0.04 | $0.14 | 128K | 10.3 |
| NVIDIA Nemotron Nano 9B V2 (Reasoning) | NVIDIA | $0.04 | $0.16 | N/A | 14.8 |
| Llama 3 Instruct 8B | Meta | $0.04 | $0.14 | N/A | 6.4 |
| Llama 2 Chat 7B | Meta | $0.05 | $0.25 | N/A | 9.7 |
| GPT-5 nano (minimal) | OpenAI | $0.05 | $0.40 | N/A | 13.8 |
| Qwen2.5 Turbo | Alibaba | $0.05 | $0.20 | N/A | 12.0 |
| GPT-5 nano (high) | OpenAI | $0.05 | $0.40 | N/A | 26.8 |
| Granite 4.0 H Small | IBM | $0.06 | $0.25 | N/A | 10.8 |
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | NVIDIA | $0.06 | $0.24 | N/A | 24.3 |
| Nova Lite | Amazon | $0.06 | $0.24 | 300K | 12.7 |
| Devstral Small (May '25) | Mistral | $0.06 | $0.12 | N/A | 18.0 |
| gpt-oss-20B (high) | OpenAI | $0.06 | $0.20 | N/A | 24.5 |
| GLM-4.7-Flash (Reasoning) | Z AI | $0.07 | $0.40 | N/A | 30.1 |
| Llama 3.2 Instruct 3B | Meta | $0.08 | $0.08 | N/A | 9.7 |
| MiMo-V2-Flash (Feb 2026) | Xiaomi | $0.10 | $0.30 | N/A | 41.5 |
| GPT-4.1 nano | OpenAI | $0.10 | $0.40 | 1.0M | 13.0 |
| Llama 3.1 Instruct 8B | Meta | $0.10 | $0.10 | N/A | 11.8 |
| Llama 3.2 Instruct 1B | Meta | $0.10 | $0.10 | N/A | 6.3 |
| Gemini 2.5 Flash-Lite (Reasoning) | $0.10 | $0.40 | N/A | 17.6 |
| Mistral Small 3 | Mistral | $0.10 | $0.30 | N/A | 12.7 |
| Mistral Small 3.1 | Mistral | $0.10 | $0.30 | N/A | 14.5 |
| Mistral Small 3.2 | Mistral | $0.10 | $0.30 | N/A | 15.1 |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | $0.10 | $0.40 | N/A | 21.6 |
| Ministral 3 3B | Mistral | $0.10 | $0.10 | N/A | 11.2 |
| Llama Nemotron Super 49B v1.5 (Reasoning) | NVIDIA | $0.10 | $0.40 | N/A | 18.7 |
| Step 3.5 Flash | StepFun | $0.10 | $0.30 | N/A | 37.8 |
| Olmo 3 7B Instruct | Allen Institute for AI | $0.10 | $0.20 | N/A | 8.2 |
| Apertus 8B Instruct | Swiss AI Initiative | $0.10 | $0.20 | N/A | 5.9 |
| Qwen3.5 9B (Reasoning) | Alibaba | $0.10 | $0.15 | N/A | 32.4 |
| Qwen3 0.6B (Reasoning) | Alibaba | $0.11 | $1.26 | N/A | 6.5 |
| Qwen3 4B (Reasoning) | Alibaba | $0.11 | $1.26 | N/A | 14.2 |
| Qwen3 1.7B (Reasoning) | Alibaba | $0.11 | $1.26 | N/A | 8.0 |
| QwQ 32B-Preview | Alibaba | $0.12 | $0.18 | N/A | 15.2 |
| Phi-4 | Microsoft Azure | $0.13 | $0.50 | N/A | 10.4 |
| Hermes 4 - Llama-3.1 70B (Reasoning) | Nous Research | $0.13 | $0.40 | N/A | 16.0 |
| Ling-flash-2.0 | Inclusionai | $0.14 | $0.57 | N/A | 15.7 |
| Ring-flash-2.0 | Inclusionai | $0.14 | $0.57 | N/A | 14.0 |
| Gemini 2.0 Flash (Feb '25) | $0.15 | $0.60 | N/A | 18.5 |
| Solar Mini | Upstage | $0.15 | $0.15 | N/A | 11.9 |
| GPT-4o mini | OpenAI | $0.15 | $0.60 | N/A | 12.6 |
| gpt-oss-120B (high) | OpenAI | $0.15 | $0.60 | N/A | 33.3 |
| Mistral Small 4 (Reasoning) | Mistral | $0.15 | $0.60 | N/A | 26.9 |
| Ministral 3 8B | Mistral | $0.15 | $0.15 | N/A | 14.8 |
| Llama 3.2 Instruct 11B (Vision) | Meta | $0.16 | $0.16 | N/A | 8.7 |
| Llama 4 Scout | Meta | $0.17 | $0.66 | N/A | 13.5 |
| Qwen3 VL 8B (Reasoning) | Alibaba | $0.18 | $2.10 | N/A | 16.7 |
| Qwen3 8B (Reasoning) | Alibaba | $0.18 | $2.10 | N/A | 13.2 |
| Qwen3 VL 8B Instruct | Alibaba | $0.18 | $0.70 | N/A | 14.3 |
| Qwen3 30B A3B 2507 (Reasoning) | Alibaba | $0.20 | $2.40 | N/A | 22.4 |
| GPT-5.4 nano (xhigh) | OpenAI | $0.20 | $1.25 | N/A | 44.4 |
| Grok Code Fast 1 | xAI | $0.20 | $1.50 | 256K | 28.7 |
| Ministral 3 14B | Mistral | $0.20 | $0.20 | N/A | 16.0 |
| Qwen3 VL 30B A3B (Reasoning) | Alibaba | $0.20 | $2.40 | N/A | 19.7 |
| NVIDIA Nemotron Nano 12B v2 VL (Reasoning) | NVIDIA | $0.20 | $0.60 | N/A | 14.9 |
| Qwen3 30B A3B 2507 Instruct | Alibaba | $0.20 | $0.80 | N/A | 15.0 |
| Qwen3 VL 30B A3B Instruct | Alibaba | $0.20 | $0.80 | N/A | 16.1 |
| Olmo 3.1 32B Instruct | Allen Institute for AI | $0.20 | $0.60 | N/A | 12.2 |
| Qwen3 30B A3B (Reasoning) | Alibaba | $0.20 | $2.40 | N/A | 15.3 |
| Grok 4.1 Fast (Reasoning) | xAI | $0.20 | $0.50 | N/A | 38.6 |
| Reka Flash 3 | Reka AI | $0.20 | $0.80 | N/A | 9.5 |
| Jamba 1.6 Mini | AI21 Labs | $0.20 | $0.40 | N/A | 7.9 |
| Grok 4 Fast (Reasoning) | xAI | $0.20 | $0.50 | N/A | 35.1 |
| Jamba 1.5 Mini | AI21 Labs | $0.20 | $0.40 | N/A | 8.0 |
| Mistral Small (Sep '24) | Mistral | $0.20 | $0.60 | N/A | 10.2 |
| GLM-4.5-Air | Z AI | $0.20 | $1.10 | N/A | 23.2 |
| Reka Flash (Sep '24) | Reka AI | $0.20 | $0.80 | N/A | 12.0 |
| Seed-OSS-36B-Instruct | ByteDance Seed | $0.21 | $0.57 | N/A | 25.2 |
| Qwen3.5 35B A3B (Reasoning) | Alibaba | $0.25 | $2.00 | N/A | 37.1 |
| Mistral 7B Instruct | Mistral | $0.25 | $0.25 | N/A | 7.4 |
| Qwen3 Omni 30B A3B Instruct | Alibaba | $0.25 | $0.97 | N/A | 10.7 |
| Claude 3 Haiku | Anthropic | $0.25 | $1.25 | 200K | 12.3 |
| GPT-5 mini (high) | OpenAI | $0.25 | $2.00 | N/A | 41.2 |
| GPT-5.1 Codex mini (high) | OpenAI | $0.25 | $2.00 | N/A | 38.6 |
| GPT-5 mini (minimal) | OpenAI | $0.25 | $2.00 | N/A | 20.7 |
| Gemini 3.1 Flash-Lite Preview | $0.25 | $1.50 | N/A | 33.5 |
| Qwen3 Omni 30B A3B (Reasoning) | Alibaba | $0.25 | $0.97 | N/A | 15.6 |
| Mercury 2 | Inception | $0.25 | $0.75 | 128K | 32.8 |
| DeepSeek R1 Distill Qwen 32B | DeepSeek | $0.27 | $0.27 | 33K | 17.2 |
| DeepSeek V3.2 Exp (Reasoning) | DeepSeek | $0.28 | $0.42 | N/A | 32.9 |
| ERNIE 4.5 300B A47B | Baidu | $0.28 | $1.10 | 123K | 15.0 |
| DeepSeek V3.2 (Reasoning) | DeepSeek | $0.28 | $0.42 | N/A | 41.7 |
| Hermes 3 - Llama-3.1 70B | Nous Research | $0.30 | $0.30 | N/A | 10.6 |
| Grok 3 mini Reasoning (high) | xAI | $0.30 | $0.50 | N/A | 32.1 |
| MiniMax-M2.7 | MiniMax | $0.30 | $1.20 | N/A | 49.6 |
| GLM-4.6V (Reasoning) | Z AI | $0.30 | $0.90 | N/A | 23.4 |
| KAT-Coder-Pro V1 | KwaiKAT | $0.30 | $1.20 | N/A | 36.0 |
| Nova 2.0 Lite (medium) | Amazon | $0.30 | $2.50 | N/A | 29.7 |
| Nova 2.0 Omni (medium) | Amazon | $0.30 | $2.50 | N/A | 28.0 |
| MiniMax-M2 | MiniMax | $0.30 | $1.20 | N/A | 36.1 |
| MiniMax-M2.1 | MiniMax | $0.30 | $1.20 | N/A | 39.4 |
| MiniMax-M2.5 | MiniMax | $0.30 | $1.20 | N/A | 41.9 |
| Qwen3.5 27B (Reasoning) | Alibaba | $0.30 | $2.40 | N/A | 42.1 |
| Gemini 2.5 Flash (Reasoning) | $0.30 | $2.50 | N/A | 27.0 |
| NVIDIA Nemotron 3 Super 120B A12B (Reasoning) | NVIDIA | $0.30 | $0.75 | N/A | 36.0 |
| Llama 4 Maverick | Meta | $0.31 | $0.91 | N/A | 18.4 |
| Qwen3 Coder Next | Alibaba | $0.35 | $1.20 | N/A | 28.3 |
| Qwen3 14B (Reasoning) | Alibaba | $0.35 | $4.20 | N/A | 16.2 |
| Devstral Medium | Mistral | $0.40 | $2.00 | N/A | 18.7 |
| Mistral Medium 3.1 | Mistral | $0.40 | $2.00 | N/A | 21.3 |
| DeepSeek V3 (Dec '24) | DeepSeek | $0.40 | $0.89 | N/A | 16.5 |
| Qwen3.5 122B A10B (Reasoning) | Alibaba | $0.40 | $3.20 | N/A | 41.6 |
| Mistral Medium 3 | Mistral | $0.40 | $2.00 | N/A | 18.8 |
| DeepSeek V3.1 Terminus (Reasoning) | DeepSeek | $0.40 | $2.00 | N/A | 33.9 |
| GPT-4.1 mini | OpenAI | $0.40 | $1.60 | 1.0M | 22.9 |
| Qwen3 Coder 30B A3B Instruct | Alibaba | $0.45 | $2.25 | N/A | 20.0 |
| GLM-4.5 (Reasoning) | Z AI | $0.49 | $1.90 | N/A | 26.4 |
| Gemini 3 Flash Preview (Reasoning) | $0.50 | $3.00 | N/A | 46.4 |
| Mistral Large 3 | Mistral | $0.50 | $1.50 | N/A | 22.8 |
| GPT-3.5 Turbo | OpenAI | $0.50 | $1.50 | 16K | 9.0 |
| Magistral Small 1.2 | Mistral | $0.50 | $1.50 | N/A | 18.2 |
| Qwen3 Next 80B A3B (Reasoning) | Alibaba | $0.50 | $6.00 | N/A | 26.7 |
| Qwen3 Next 80B A3B Instruct | Alibaba | $0.50 | $2.00 | N/A | 20.1 |
| Command-R (Mar '24) | Cohere | $0.50 | $1.50 | N/A | 7.4 |
| Mixtral 8x7B Instruct | Mistral | $0.54 | $0.60 | N/A | 7.7 |
| MiniMax M1 80k | MiniMax | $0.55 | $2.20 | N/A | 24.4 |
| Llama 3.1 Instruct 70B | Meta | $0.56 | $0.56 | N/A | 12.5 |
| DeepSeek V3.1 (Non-reasoning) | DeepSeek | $0.56 | $1.68 | N/A | 28.1 |
| Kimi K2 | Kimi | $0.57 | $2.40 | N/A | 26.3 |
| GLM-4.6 (Reasoning) | Z AI | $0.57 | $2.20 | N/A | 32.5 |
| Llama 3 Instruct 70B | Meta | $0.58 | $1.75 | N/A | 8.9 |
| Llama 3.3 Instruct 70B | Meta | $0.58 | $0.71 | N/A | 14.5 |
| GLM-4.5V (Reasoning) | Z AI | $0.60 | $1.80 | N/A | 15.1 |
| Qwen3.5 397B A17B (Reasoning) | Alibaba | $0.60 | $3.60 | N/A | 45.0 |
| Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) | NVIDIA | $0.60 | $1.80 | N/A | 15.0 |
| Kimi K2 Thinking | Kimi | $0.60 | $2.50 | N/A | 40.9 |
| GLM-4.7 (Reasoning) | Z AI | $0.60 | $2.20 | N/A | 42.1 |
| Kimi K2.5 (Reasoning) | Kimi | $0.60 | $3.00 | N/A | 46.8 |
| QwQ 32B | Alibaba | $0.66 | $1.00 | N/A | 19.7 |
| Qwen3 32B (Reasoning) | Alibaba | $0.70 | $8.40 | N/A | 16.5 |
| Qwen3 VL 235B A22B (Reasoning) | Alibaba | $0.70 | $8.40 | N/A | 27.6 |
| Qwen3 235B A22B 2507 Instruct | Alibaba | $0.70 | $2.80 | N/A | 25.0 |
| Qwen3 VL 32B (Reasoning) | Alibaba | $0.70 | $8.40 | N/A | 24.7 |
| Qwen3 VL 32B Instruct | Alibaba | $0.70 | $2.80 | N/A | 17.2 |
| Qwen3 VL 235B A22B Instruct | Alibaba | $0.70 | $2.80 | N/A | 20.8 |
| Qwen3 235B A22B (Reasoning) | Alibaba | $0.70 | $8.40 | N/A | 19.8 |
| DeepSeek R1 Distill Llama 70B | DeepSeek | $0.70 | $1.05 | 131K | 16.0 |
| Qwen3 235B A22B 2507 (Reasoning) | Alibaba | $0.70 | $8.40 | N/A | 29.5 |
| Llama 3.2 Instruct 90B (Vision) | Meta | $0.72 | $0.72 | N/A | 11.9 |
| GPT-5.4 mini (xhigh) | OpenAI | $0.75 | $4.50 | N/A | 48.1 |
| Claude 3.5 Haiku | Anthropic | $0.80 | $4.00 | 200K | 18.7 |
| Nova Pro | Amazon | $0.80 | $3.20 | 300K | 13.5 |
| Kimi K2 0905 | Kimi | $0.80 | $2.25 | N/A | 30.9 |
| Apertus 70B Instruct | Swiss AI Initiative | $0.82 | $2.92 | N/A | 7.7 |
| Claude 4.5 Haiku (Reasoning) | Anthropic | $1.00 | $5.00 | N/A | 37.1 |
| GLM-5 (Reasoning) | Z AI | $1.00 | $3.20 | N/A | 49.8 |
| Hermes 4 - Llama-3.1 405B (Reasoning) | Nous Research | $1.00 | $3.00 | N/A | 18.6 |
| MiMo-V2-Pro | Xiaomi | $1.00 | $3.00 | N/A | 49.2 |
| o3-mini | OpenAI | $1.10 | $4.40 | N/A | 25.9 |
| o4-mini (high) | OpenAI | $1.10 | $4.40 | N/A | 33.1 |
| Llama 3.1 Nemotron Instruct 70B | NVIDIA | $1.20 | $1.20 | N/A | 13.4 |
| Qwen3 Max (Preview) | Alibaba | $1.20 | $6.00 | N/A | 26.1 |
| Qwen3 Max | Alibaba | $1.20 | $6.00 | N/A | 31.4 |
| Qwen3 Max Thinking | Alibaba | $1.20 | $6.00 | N/A | 39.9 |
| Qwen3 Max Thinking (Preview) | Alibaba | $1.20 | $6.00 | N/A | 32.5 |
| Nova 2.0 Pro Preview (medium) | Amazon | $1.25 | $10 | N/A | 35.7 |
| GPT-5 (minimal) | OpenAI | $1.25 | $10 | N/A | 23.9 |
| GPT-5.1 (high) | OpenAI | $1.25 | $10 | N/A | 47.7 |
| GPT-5 (high) | OpenAI | $1.25 | $10 | N/A | 44.6 |
| GPT-5.1 Codex (high) | OpenAI | $1.25 | $10 | N/A | 43.1 |
| GPT-5 (ChatGPT) | OpenAI | $1.25 | $10 | N/A | 21.8 |
| GPT-5 Codex (high) | OpenAI | $1.25 | $10 | N/A | 44.6 |
| Gemini 2.5 Pro Preview (May' 25) | $1.25 | $10 | N/A | 29.5 |
| Gemini 2.5 Pro | $1.25 | $10 | 1.0M | 34.6 |
| DeepSeek V3 0324 | DeepSeek | $1.25 | $1.45 | 164K | 22.3 |
| DeepSeek R1 (Jan '25) | DeepSeek | $1.35 | $4.00 | N/A | 18.8 |
| DeepSeek R1 0528 (May '25) | DeepSeek | $1.35 | $5.40 | N/A | 27.1 |
| Qwen3 Coder 480B A35B Instruct | Alibaba | $1.50 | $7.50 | N/A | 24.8 |
| Qwen2.5 Max | Alibaba | $1.60 | $6.40 | N/A | 16.3 |
| GPT-5.2 (xhigh) | OpenAI | $1.75 | $14 | N/A | 51.3 |
| GPT-5.2 Codex (xhigh) | OpenAI | $1.75 | $14 | N/A | 49.0 |
| GPT-5.3 Codex (xhigh) | OpenAI | $1.75 | $14 | N/A | 54.0 |
| Jamba 1.5 Large | AI21 Labs | $2.00 | $8.00 | N/A | 10.7 |
| Gemini 3 Pro Preview (high) | $2.00 | $12 | N/A | 48.4 |
| Gemini 3.1 Pro Preview | $2.00 | $12 | 1.0M | 57.2 |
| Jamba 1.7 Large | AI21 Labs | $2.00 | $8.00 | N/A | 10.9 |
| Mistral Large 2 (Nov '24) | Mistral | $2.00 | $6.00 | N/A | 15.1 |
| Pixtral Large | Mistral | $2.00 | $6.00 | N/A | 14.0 |
| Grok 4.20 Beta 0309 (Reasoning) | xAI | $2.00 | $6.00 | N/A | 48.5 |
| Magistral Medium 1.2 | Mistral | $2.00 | $5.00 | N/A | 27.1 |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1.0M | 26.3 |
| o3 | OpenAI | $2.00 | $8.00 | 200K | 38.4 |
| Jamba 1.6 Large | AI21 Labs | $2.00 | $8.00 | N/A | 10.6 |
| GPT-5.4 (xhigh) | OpenAI | $2.50 | $15 | N/A | 57.2 |
| Nova Premier | Amazon | $2.50 | $13 | 1M | 19.0 |
| Command A | Cohere | $2.50 | $10 | 256K | 13.5 |
| GPT-4o (Aug '24) | OpenAI | $2.50 | $10 | N/A | 18.6 |
| Llama 3.1 Instruct 405B | Meta | $2.75 | $6.50 | N/A | 17.4 |
| Mistral Medium | Mistral | $2.75 | $8.10 | N/A | 9.0 |
| Claude 4 Sonnet (Reasoning) | Anthropic | $3.00 | $15 | N/A | 38.7 |
| Command-R+ (Apr '24) | Cohere | $3.00 | $15 | N/A | 8.3 |
| Grok 4 | xAI | $3.00 | $15 | 256K | 41.5 |
| Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | $3.00 | $15 | N/A | 51.7 |
| Claude 3.7 Sonnet (Reasoning) | Anthropic | $3.00 | $15 | N/A | 34.7 |
| Grok 3 | xAI | $3.00 | $15 | 131K | 25.2 |
| Claude 3.5 Sonnet (Oct '24) | Anthropic | $3.00 | $15 | N/A | 15.9 |
| Claude 4.5 Sonnet (Reasoning) | Anthropic | $3.00 | $15 | N/A | 43.0 |
| Claude 3 Sonnet | Anthropic | $3.00 | $15 | N/A | 10.3 |
| Mistral Large (Feb '24) | Mistral | $4.00 | $12 | N/A | 9.9 |
| Claude Opus 4.6 (Adaptive Reasoning, Max Effort) | Anthropic | $5.00 | $25 | N/A | 53.0 |
| Claude Opus 4.5 (Reasoning) | Anthropic | $5.00 | $25 | N/A | 49.7 |
| GPT-4 Turbo | OpenAI | $10 | $30 | 128K | 13.7 |
| Claude 3 Opus | Anthropic | $15 | $75 | 200K | 18.0 |
| Claude 4.1 Opus (Reasoning) | Anthropic | $15 | $75 | N/A | 42.0 |
| o1 | OpenAI | $15 | $60 | 200K | 30.8 |
| Claude 4 Opus (Reasoning) | Anthropic | $15 | $75 | N/A | 39.0 |
| o1-preview | OpenAI | $17 | $66 | N/A | 23.7 |
| o3-pro | OpenAI | $20 | $80 | N/A | 40.7 |
| GPT-4 | OpenAI | $30 | $60 | 8K | 12.8 |
| o1-pro | OpenAI | $150 | $600 | 200K | 25.8 |
Yes, there are 136 free models available, and some rank well on intelligence benchmarks. Free models are a great way to start, though premium models generally offer better quality for demanding tasks.