AIToolRank
ModelsCompareReviewsNewsQuizCalculator
AIToolRank

AI model specs, pricing & comparisons.

© 2026 AIToolRank

Explore

ModelsCompareCalculatorLeaderboardMethodology

Top Providers

OpenAIAnthropicGoogleDeepSeekMeta

Resources

ReviewsNewsModel QuizMethodology
Home/Pricing

AI Model Pricing Comparison 2026

Compare pricing for 348+ AI models side by side. From free open-source models to premium enterprise options, find the right price-performance balance for your use case.

Price Tiers

Free

$0

136 models

Top by intelligence

GLM-5-Turbo46.8MiMo-V2-Omni43.4Doubao Seed Code33.5

Budget

<$1/1M

150 models

Top by intelligence

GLM-5 (Reasoning)49.8MiniMax-M2.749.6MiMo-V2-Pro49.2

Mid-Range

$1-$5/1M

53 models

Top by intelligence

Gemini 3.1 Pro Preview57.2GPT-5.4 (xhigh)57.2GPT-5.3 Codex (xhigh)54.0

Premium

$5-$15/1M

5 models

Top by intelligence

Claude 4.1 Opus (Reasoning)42.0Claude 4 Opus (Reasoning)39.0o130.8

Enterprise

$15+/1M

4 models

Top by intelligence

o3-pro40.7o1-pro25.8o1-preview23.7

Understanding AI Pricing

AI model pricing is based on tokens, which are the fundamental units of text that language models process. One token is roughly 3/4 of an English word, so 1 million tokens equals approximately 750,000 words or about 1,500 pages of text.

Most providers charge separately for input tokens (the text you send to the model) and output tokens (the text the model generates). Output tokens typically cost 2-4x more than input tokens because generating new text requires more computational work than reading existing text. Each output token must be produced sequentially through a full forward pass of the neural network.

To estimate your monthly cost, multiply your average daily requests by 30, then by your average tokens per request. For example, 100 requests per day at 2,000 tokens each equals 6 million tokens per month. At $3/1M tokens, that would be $18/month. Some providers also offer cached token pricing at a discount for repeated prompts, and batch processing at lower rates for non-real-time workloads.

Free models are a great starting point for experimentation and low-volume use cases. As your needs grow, budget and mid-range models offer an excellent balance of quality and cost. Premium and enterprise models deliver the highest benchmark scores for tasks demanding maximum accuracy, like complex coding or research analysis.

Monthly Cost Example

100 requests/day at 2,000 tokens each (input + output)

ModelProviderInput/1MOutput/1MEst. Monthly
Gemma 3n E4B InstructGoogle$0.02$0.04$0.36/mo
Granite 3.3 8B (Non-reasoning)IBM$0.03$0.25$1.68/mo
LFM2 24B A2BLiquid AI$0.03$0.12$0.90/mo
Nova MicroAmazon$0.04$0.14$1.05/mo
NVIDIA Nemotron Nano 9B V2 (Reasoning)

All Model Pricing

ModelProviderInput/1MOutput/1MContextIntelligence
LFM2.5-1.2B-ThinkingFreeLiquid AIFreeFreeN/A8.1
Devstral Small 2FreeMistralFreeFreeN/A19.5
LFM2 2.6BFreeLiquid AIFreeFreeN/A

Frequently Asked Questions

How is AI model pricing calculated?

AI models charge per token, where a token is roughly 3/4 of a word. Prices are quoted per million tokens. Most models charge separately for input (prompt) tokens and output (completion) tokens, with output typically costing 2-4x more than input.

Why do output tokens cost more than input tokens?

Output tokens require the model to generate new text, which is computationally more expensive than reading input. The model must perform inference for each output token sequentially, while input tokens can be processed in parallel.

What is the cheapest good AI model?

Among paid models with benchmark data, Gemma 3n E4B Instruct offers some of the lowest pricing at $0.02/1M input tokens. There are also many free models available.

How much does it cost to use AI models per month?

For a typical individual workload of 100 requests per day at 2,000 tokens each, monthly costs range from $0 (free models) to $50+ (premium models). Enterprise usage at higher volumes can cost significantly more. Use the cost examples on this page to estimate your specific usage.

Browse by use case:Best for Coding·Best for Writing·Cheapest Models·Free Models·Fastest Models
NVIDIA
$0.04
$0.16
$1.20/mo
8.0
LFM2.5-VL-1.6BFreeLiquid AIFreeFreeN/A6.2
LFM2.5-1.2B-InstructFreeLiquid AIFreeFreeN/A8.0
Olmo 3.1 32B ThinkFreeAllen Institute for AIFreeFreeN/A13.9
Molmo2-8BFreeAllen Institute for AIFreeFreeN/A7.3
Molmo 7B-DFreeAllen Institute for AIFreeFreeN/A9.2
Gemma 3 1B InstructFreeGoogleFreeFreeN/A5.5
SonarFreePerplexityFreeFree127K15.5
Step3 VL 10BFreeStepFunFreeFreeN/A15.4
Kimi Linear 48B A3B InstructFreeKimiFreeFreeN/A14.4
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)FreeNVIDIAFreeFreeN/A14.4
NVIDIA Nemotron 3 Nano 4BFreeNVIDIAFreeFreeN/A14.7
Llama 3.3 Nemotron Super 49B v1 (Reasoning)FreeNVIDIAFreeFreeN/A18.5
Solar Pro 2 (Reasoning)FreeUpstageFreeFreeN/A14.9
Devstral 2FreeMistralFreeFreeN/A22.0
Solar Open 100B (Reasoning)FreeUpstageFreeFreeN/A21.7
LFM2 8B A1BFreeLiquid AIFreeFreeN/A7.0
Sonar ProFreePerplexityFreeFree200K15.2
DeepSeek V3.2 SpecialeFreeDeepSeekFreeFree164K29.4
DeepSeek R1 0528 Qwen3 8BFreeDeepSeekFreeFree128K16.4
R1 1776FreePerplexityFreeFreeN/A12.0
Falcon-H1R-7BFreeTII UAEFreeFreeN/A15.8
Phi-4 Multimodal InstructFreeMicrosoft AzureFreeFreeN/A10.0
Phi-4 Mini InstructFreeMicrosoft AzureFreeFreeN/A8.4
Qwen3 4B 2507 (Reasoning)FreeAlibabaFreeFreeN/A18.2
Qwen2.5 Instruct 32BFreeAlibabaFreeFreeN/A13.2
Qwen3 4B 2507 InstructFreeAlibabaFreeFreeN/A12.9
Qwen3 VL 4B (Reasoning)FreeAlibabaFreeFreeN/A13.7
GPT-4o (ChatGPT)FreeOpenAIFreeFreeN/A14.1
Qwen Chat 72BFreeAlibabaFreeFreeN/A8.8
DBRX InstructFreeDatabricksFreeFreeN/A8.3
Qwen1.5 Chat 110BFreeAlibabaFreeFreeN/A9.5
Llama 2 Chat 70BFreeMetaFreeFreeN/A8.4
Llama 2 Chat 13BFreeMetaFreeFreeN/A8.4
Gemini 2.0 Pro Experimental (Feb '25)FreeGoogleFreeFreeN/A18.1
Gemini 2.0 Flash (experimental)FreeGoogleFreeFreeN/A16.8
Gemini 1.5 Pro (Sep '24)FreeGoogleFreeFreeN/A16.0
Gemini 2.0 Flash-Lite (Preview)FreeGoogleFreeFreeN/A14.5
Gemini 1.0 ProFreeGoogleFreeFree33K8.5
Solar Pro 2 (Preview) (Reasoning)FreeUpstageFreeFreeN/A18.8
Gemini 1.5 Flash (Sep '24)FreeGoogleFreeFreeN/A13.8
Gemini 1.5 Flash-8BFreeGoogleFreeFreeN/A11.1
LFM2 1.2BFreeLiquid AIFreeFreeN/A6.3
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)FreeGoogleFreeFreeN/A31.1
LFM 40BFreeLiquid AIFreeFreeN/A8.8
Gemini 1.0 UltraFreeGoogleFreeFreeN/A10.1
Gemini 2.0 Flash-Lite (Feb '25)FreeGoogleFreeFreeN/A14.7
Qwen2.5 Coder Instruct 7B FreeAlibabaFreeFreeN/A10.0
Gemini 2.0 Flash Thinking Experimental (Jan '25)FreeGoogleFreeFreeN/A19.6
Gemma 3n E4B Instruct Preview (May '25)FreeGoogleFreeFreeN/A10.1
PALM-2FreeGoogleFreeFreeN/A8.6
Gemini 2.5 Pro Preview (Mar' 25)FreeGoogleFreeFreeN/A30.3
Phi-3 Mini Instruct 3.8BFreeMicrosoft AzureFreeFreeN/A10.1
OpenChat 3.5 (1210)FreeOpenChatFreeFreeN/A8.3
Grok 2 (Dec '24)FreexAIFreeFreeN/A13.9
Grok 3 Reasoning BetaFreexAIFreeFreeN/A21.6
Grok BetaFreexAIFreeFreeN/A13.3
Claude InstantFreeAnthropicFreeFreeN/A7.4
DeepSeek-V2-ChatFreeDeepSeekFreeFreeN/A9.1
DeepSeek Coder V2 Lite InstructFreeDeepSeekFreeFreeN/A8.5
DeepSeek R1 Distill Qwen 1.5BFreeDeepSeekFreeFreeN/A9.1
DeepSeek LLM 67B Chat (V1)FreeDeepSeekFreeFreeN/A8.4
DeepSeek R1 Distill Llama 8BFreeDeepSeekFreeFreeN/A12.1
GPT-4o (March 2025, chatgpt-4o-latest)FreeOpenAIFreeFreeN/A18.6
Claude 2.1FreeAnthropicFreeFreeN/A9.3
Claude 2.0FreeAnthropicFreeFreeN/A9.1
GPT-4.5 (Preview)FreeOpenAIFreeFreeN/A20.0
Sonar Reasoning ProFreePerplexityFreeFree128K24.6
Mixtral 8x22B InstructFreeMistralFreeFreeN/A9.8
o1-miniFreeOpenAIFreeFreeN/A20.4
Doubao Seed CodeFreeByteDance SeedFreeFreeN/A33.5
Ring-1TFreeInclusionaiFreeFreeN/A22.8
Qwen3 VL 4B InstructFreeAlibabaFreeFreeN/A9.6
Ling-1TFreeInclusionaiFreeFreeN/A19.0
Mistral SabaFreeMistralFreeFreeN/A12.1
Sonar ReasoningFreePerplexityFreeFree127K17.9
Magistral Small 1FreeMistralFreeFreeN/A16.8
Ling-mini-2.0FreeInclusionaiFreeFreeN/A9.2
Magistral Medium 1FreeMistralFreeFreeN/A18.8
Qwen3.5 4B (Reasoning)FreeAlibabaFreeFreeN/A27.1
MiniMax M1 40kFreeMiniMaxFreeFreeN/A20.9
Qwen3.5 2B (Reasoning)FreeAlibabaFreeFreeN/A16.3
Qwen3.5 0.8B (Reasoning)FreeAlibabaFreeFreeN/A10.5
Jamba Reasoning 3BFreeAI21 LabsFreeFreeN/A9.6
Llama 3.1 Tulu3 405BFreeAllen Institute for AIFreeFreeN/A14.1
OLMo 2 32BFreeAllen Institute for AIFreeFreeN/A10.6
Jamba 1.7 MiniFreeAI21 LabsFreeFreeN/A8.1
Apriel-v1.6-15B-ThinkerFreeServiceNowFreeFreeN/A27.6
DeepSeek R1 Distill Qwen 14BFreeDeepSeekFreeFreeN/A15.8
DeepSeek-V2.5 (Dec '24)FreeDeepSeekFreeFreeN/A12.5
DeepSeek-Coder-V2FreeDeepSeekFreeFreeN/A10.6
Tiny Aya GlobalFreeCohereFreeFreeN/A4.7
Qwen Chat 14BFreeAlibabaFreeFreeN/A7.4
GLM-5-TurboFreeZ AIFreeFreeN/A46.8
Nanbeige4.1-3BFreeNanbeigeFreeFreeN/A16.1
Grok-1FreexAIFreeFreeN/A11.7
Tri-21B-ThinkFreeTrillion LabsFreeFreeN/A18.6
Tri-21B-think PreviewFreeTrillion LabsFreeFreeN/A20.0
LongCat Flash LiteFreeLongCatFreeFreeN/A23.9
HyperCLOVA X SEED Think (32B)FreeNaverFreeFreeN/A23.7
OLMo 2 7BFreeAllen Institute for AIFreeFreeN/A9.3
Olmo 3 32B ThinkFreeAllen Institute for AIFreeFreeN/A12.1
Mi:dm K 2.5 ProFreeKorea TelecomFreeFreeN/A23.1
K2 Think V2FreeMBZUAI Institute of Foundation ModelsFreeFreeN/A24.1
Sarvam M (Reasoning)FreeSarvamFreeFreeN/A8.4
K2-V2 (high)FreeMBZUAI Institute of Foundation ModelsFreeFreeN/A20.6
Motif-2-12.7B-ReasoningFreeMotif TechnologiesFreeFreeN/A19.1
INTELLECT-3FreePrime IntellectFreeFreeN/A22.2
Sarvam 30B (high)FreeSarvamFreeFreeN/A12.3
Sarvam 105B (high)FreeSarvamFreeFreeN/A18.2
ERNIE 5.0 Thinking PreviewFreeBaiduFreeFreeN/A29.1
MiMo-V2-OmniFreeXiaomiFreeFreeN/A43.4
K-EXAONE (Reasoning)FreeLG AI ResearchFreeFreeN/A32.1
Llama 65BFreeMetaFreeFreeN/A7.4
EXAONE 4.0 32B (Reasoning)FreeLG AI ResearchFreeFreeN/A16.7
Apriel-v1.5-15B-ThinkerFreeServiceNowFreeFreeN/A28.3
Exaone 4.0 1.2B (Reasoning)FreeLG AI ResearchFreeFreeN/A8.3
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)FreeNous ResearchFreeFreeN/A10.9
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)FreeNous ResearchFreeFreeN/A7.6
Granite 4.0 350MFreeIBMFreeFreeN/A6.1
Arctic InstructFreeSnowflakeFreeFreeN/A8.8
Granite 4.0 H 1BFreeIBMFreeFreeN/A8.0
Qwen2.5 Instruct 72BFreeAlibabaFreeFreeN/A15.6
Qwen2.5 Coder Instruct 32BFreeAlibabaFreeFreeN/A12.9
Qwen2 Instruct 72BFreeAlibabaFreeFreeN/A11.7
Granite 4.0 H 350MFreeIBMFreeFreeN/A5.4
Granite 4.0 1BFreeIBMFreeFreeN/A7.3
Gemma 3 270MFreeGoogleFreeFreeN/A7.7
Granite 4.0 MicroFreeIBMFreeFreeN/A7.7
Gemma 3 4B InstructFreeGoogleFreeFreeN/A6.3
Gemma 3n E2B InstructFreeGoogleFreeFreeN/A4.8
Gemma 3 27B InstructFreeGoogleFreeFreeN/A10.3
Gemma 3 12B InstructFreeGoogleFreeFreeN/A8.8
Olmo 3 7B ThinkFreeAllen Institute for AIFreeFreeN/A9.4
Gemma 3n E4B InstructGoogle$0.02$0.04N/A6.4
Granite 3.3 8B (Non-reasoning)IBM$0.03$0.25N/A7.0
LFM2 24B A2BLiquid AI$0.03$0.12N/A10.5
Nova MicroAmazon$0.04$0.14128K10.3
NVIDIA Nemotron Nano 9B V2 (Reasoning)NVIDIA$0.04$0.16N/A14.8
Llama 3 Instruct 8BMeta$0.04$0.14N/A6.4
Llama 2 Chat 7BMeta$0.05$0.25N/A9.7
GPT-5 nano (minimal)OpenAI$0.05$0.40N/A13.8
Qwen2.5 TurboAlibaba$0.05$0.20N/A12.0
GPT-5 nano (high)OpenAI$0.05$0.40N/A26.8
Granite 4.0 H SmallIBM$0.06$0.25N/A10.8
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)NVIDIA$0.06$0.24N/A24.3
Nova LiteAmazon$0.06$0.24300K12.7
Devstral Small (May '25)Mistral$0.06$0.12N/A18.0
gpt-oss-20B (high)OpenAI$0.06$0.20N/A24.5
GLM-4.7-Flash (Reasoning)Z AI$0.07$0.40N/A30.1
Llama 3.2 Instruct 3BMeta$0.08$0.08N/A9.7
MiMo-V2-Flash (Feb 2026)Xiaomi$0.10$0.30N/A41.5
GPT-4.1 nanoOpenAI$0.10$0.401.0M13.0
Llama 3.1 Instruct 8BMeta$0.10$0.10N/A11.8
Llama 3.2 Instruct 1BMeta$0.10$0.10N/A6.3
Gemini 2.5 Flash-Lite (Reasoning)Google$0.10$0.40N/A17.6
Mistral Small 3Mistral$0.10$0.30N/A12.7
Mistral Small 3.1Mistral$0.10$0.30N/A14.5
Mistral Small 3.2Mistral$0.10$0.30N/A15.1
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)Google$0.10$0.40N/A21.6
Ministral 3 3BMistral$0.10$0.10N/A11.2
Llama Nemotron Super 49B v1.5 (Reasoning)NVIDIA$0.10$0.40N/A18.7
Step 3.5 FlashStepFun$0.10$0.30N/A37.8
Olmo 3 7B InstructAllen Institute for AI$0.10$0.20N/A8.2
Apertus 8B InstructSwiss AI Initiative$0.10$0.20N/A5.9
Qwen3.5 9B (Reasoning)Alibaba$0.10$0.15N/A32.4
Qwen3 0.6B (Reasoning)Alibaba$0.11$1.26N/A6.5
Qwen3 4B (Reasoning)Alibaba$0.11$1.26N/A14.2
Qwen3 1.7B (Reasoning)Alibaba$0.11$1.26N/A8.0
QwQ 32B-PreviewAlibaba$0.12$0.18N/A15.2
Phi-4Microsoft Azure$0.13$0.50N/A10.4
Hermes 4 - Llama-3.1 70B (Reasoning)Nous Research$0.13$0.40N/A16.0
Ling-flash-2.0Inclusionai$0.14$0.57N/A15.7
Ring-flash-2.0Inclusionai$0.14$0.57N/A14.0
Gemini 2.0 Flash (Feb '25)Google$0.15$0.60N/A18.5
Solar MiniUpstage$0.15$0.15N/A11.9
GPT-4o miniOpenAI$0.15$0.60N/A12.6
gpt-oss-120B (high)OpenAI$0.15$0.60N/A33.3
Mistral Small 4 (Reasoning)Mistral$0.15$0.60N/A26.9
Ministral 3 8BMistral$0.15$0.15N/A14.8
Llama 3.2 Instruct 11B (Vision)Meta$0.16$0.16N/A8.7
Llama 4 ScoutMeta$0.17$0.66N/A13.5
Qwen3 VL 8B (Reasoning)Alibaba$0.18$2.10N/A16.7
Qwen3 8B (Reasoning)Alibaba$0.18$2.10N/A13.2
Qwen3 VL 8B InstructAlibaba$0.18$0.70N/A14.3
Qwen3 30B A3B 2507 (Reasoning)Alibaba$0.20$2.40N/A22.4
GPT-5.4 nano (xhigh)OpenAI$0.20$1.25N/A44.4
Grok Code Fast 1xAI$0.20$1.50256K28.7
Ministral 3 14BMistral$0.20$0.20N/A16.0
Qwen3 VL 30B A3B (Reasoning)Alibaba$0.20$2.40N/A19.7
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)NVIDIA$0.20$0.60N/A14.9
Qwen3 30B A3B 2507 InstructAlibaba$0.20$0.80N/A15.0
Qwen3 VL 30B A3B InstructAlibaba$0.20$0.80N/A16.1
Olmo 3.1 32B InstructAllen Institute for AI$0.20$0.60N/A12.2
Qwen3 30B A3B (Reasoning)Alibaba$0.20$2.40N/A15.3
Grok 4.1 Fast (Reasoning)xAI$0.20$0.50N/A38.6
Reka Flash 3Reka AI$0.20$0.80N/A9.5
Jamba 1.6 MiniAI21 Labs$0.20$0.40N/A7.9
Grok 4 Fast (Reasoning)xAI$0.20$0.50N/A35.1
Jamba 1.5 MiniAI21 Labs$0.20$0.40N/A8.0
Mistral Small (Sep '24)Mistral$0.20$0.60N/A10.2
GLM-4.5-AirZ AI$0.20$1.10N/A23.2
Reka Flash (Sep '24)Reka AI$0.20$0.80N/A12.0
Seed-OSS-36B-InstructByteDance Seed$0.21$0.57N/A25.2
Qwen3.5 35B A3B (Reasoning)Alibaba$0.25$2.00N/A37.1
Mistral 7B InstructMistral$0.25$0.25N/A7.4
Qwen3 Omni 30B A3B InstructAlibaba$0.25$0.97N/A10.7
Claude 3 HaikuAnthropic$0.25$1.25200K12.3
GPT-5 mini (high)OpenAI$0.25$2.00N/A41.2
GPT-5.1 Codex mini (high)OpenAI$0.25$2.00N/A38.6
GPT-5 mini (minimal)OpenAI$0.25$2.00N/A20.7
Gemini 3.1 Flash-Lite PreviewGoogle$0.25$1.50N/A33.5
Qwen3 Omni 30B A3B (Reasoning)Alibaba$0.25$0.97N/A15.6
Mercury 2Inception$0.25$0.75128K32.8
DeepSeek R1 Distill Qwen 32BDeepSeek$0.27$0.2733K17.2
DeepSeek V3.2 Exp (Reasoning)DeepSeek$0.28$0.42N/A32.9
ERNIE 4.5 300B A47BBaidu$0.28$1.10123K15.0
DeepSeek V3.2 (Reasoning)DeepSeek$0.28$0.42N/A41.7
Hermes 3 - Llama-3.1 70BNous Research$0.30$0.30N/A10.6
Grok 3 mini Reasoning (high)xAI$0.30$0.50N/A32.1
MiniMax-M2.7MiniMax$0.30$1.20N/A49.6
GLM-4.6V (Reasoning)Z AI$0.30$0.90N/A23.4
KAT-Coder-Pro V1KwaiKAT$0.30$1.20N/A36.0
Nova 2.0 Lite (medium)Amazon$0.30$2.50N/A29.7
Nova 2.0 Omni (medium)Amazon$0.30$2.50N/A28.0
MiniMax-M2MiniMax$0.30$1.20N/A36.1
MiniMax-M2.1MiniMax$0.30$1.20N/A39.4
MiniMax-M2.5MiniMax$0.30$1.20N/A41.9
Qwen3.5 27B (Reasoning)Alibaba$0.30$2.40N/A42.1
Gemini 2.5 Flash (Reasoning)Google$0.30$2.50N/A27.0
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)NVIDIA$0.30$0.75N/A36.0
Llama 4 MaverickMeta$0.31$0.91N/A18.4
Qwen3 Coder NextAlibaba$0.35$1.20N/A28.3
Qwen3 14B (Reasoning)Alibaba$0.35$4.20N/A16.2
Devstral MediumMistral$0.40$2.00N/A18.7
Mistral Medium 3.1Mistral$0.40$2.00N/A21.3
DeepSeek V3 (Dec '24)DeepSeek$0.40$0.89N/A16.5
Qwen3.5 122B A10B (Reasoning)Alibaba$0.40$3.20N/A41.6
Mistral Medium 3Mistral$0.40$2.00N/A18.8
DeepSeek V3.1 Terminus (Reasoning)DeepSeek$0.40$2.00N/A33.9
GPT-4.1 miniOpenAI$0.40$1.601.0M22.9
Qwen3 Coder 30B A3B InstructAlibaba$0.45$2.25N/A20.0
GLM-4.5 (Reasoning)Z AI$0.49$1.90N/A26.4
Gemini 3 Flash Preview (Reasoning)Google$0.50$3.00N/A46.4
Mistral Large 3Mistral$0.50$1.50N/A22.8
GPT-3.5 TurboOpenAI$0.50$1.5016K9.0
Magistral Small 1.2Mistral$0.50$1.50N/A18.2
Qwen3 Next 80B A3B (Reasoning)Alibaba$0.50$6.00N/A26.7
Qwen3 Next 80B A3B InstructAlibaba$0.50$2.00N/A20.1
Command-R (Mar '24)Cohere$0.50$1.50N/A7.4
Mixtral 8x7B InstructMistral$0.54$0.60N/A7.7
MiniMax M1 80kMiniMax$0.55$2.20N/A24.4
Llama 3.1 Instruct 70BMeta$0.56$0.56N/A12.5
DeepSeek V3.1 (Non-reasoning)DeepSeek$0.56$1.68N/A28.1
Kimi K2Kimi$0.57$2.40N/A26.3
GLM-4.6 (Reasoning)Z AI$0.57$2.20N/A32.5
Llama 3 Instruct 70BMeta$0.58$1.75N/A8.9
Llama 3.3 Instruct 70BMeta$0.58$0.71N/A14.5
GLM-4.5V (Reasoning)Z AI$0.60$1.80N/A15.1
Qwen3.5 397B A17B (Reasoning)Alibaba$0.60$3.60N/A45.0
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)NVIDIA$0.60$1.80N/A15.0
Kimi K2 ThinkingKimi$0.60$2.50N/A40.9
GLM-4.7 (Reasoning)Z AI$0.60$2.20N/A42.1
Kimi K2.5 (Reasoning)Kimi$0.60$3.00N/A46.8
QwQ 32BAlibaba$0.66$1.00N/A19.7
Qwen3 32B (Reasoning)Alibaba$0.70$8.40N/A16.5
Qwen3 VL 235B A22B (Reasoning)Alibaba$0.70$8.40N/A27.6
Qwen3 235B A22B 2507 InstructAlibaba$0.70$2.80N/A25.0
Qwen3 VL 32B (Reasoning)Alibaba$0.70$8.40N/A24.7
Qwen3 VL 32B InstructAlibaba$0.70$2.80N/A17.2
Qwen3 VL 235B A22B InstructAlibaba$0.70$2.80N/A20.8
Qwen3 235B A22B (Reasoning)Alibaba$0.70$8.40N/A19.8
DeepSeek R1 Distill Llama 70BDeepSeek$0.70$1.05131K16.0
Qwen3 235B A22B 2507 (Reasoning)Alibaba$0.70$8.40N/A29.5
Llama 3.2 Instruct 90B (Vision)Meta$0.72$0.72N/A11.9
GPT-5.4 mini (xhigh)OpenAI$0.75$4.50N/A48.1
Claude 3.5 HaikuAnthropic$0.80$4.00200K18.7
Nova ProAmazon$0.80$3.20300K13.5
Kimi K2 0905Kimi$0.80$2.25N/A30.9
Apertus 70B InstructSwiss AI Initiative$0.82$2.92N/A7.7
Claude 4.5 Haiku (Reasoning)Anthropic$1.00$5.00N/A37.1
GLM-5 (Reasoning)Z AI$1.00$3.20N/A49.8
Hermes 4 - Llama-3.1 405B (Reasoning)Nous Research$1.00$3.00N/A18.6
MiMo-V2-ProXiaomi$1.00$3.00N/A49.2
o3-miniOpenAI$1.10$4.40N/A25.9
o4-mini (high)OpenAI$1.10$4.40N/A33.1
Llama 3.1 Nemotron Instruct 70BNVIDIA$1.20$1.20N/A13.4
Qwen3 Max (Preview)Alibaba$1.20$6.00N/A26.1
Qwen3 MaxAlibaba$1.20$6.00N/A31.4
Qwen3 Max ThinkingAlibaba$1.20$6.00N/A39.9
Qwen3 Max Thinking (Preview)Alibaba$1.20$6.00N/A32.5
Nova 2.0 Pro Preview (medium)Amazon$1.25$10N/A35.7
GPT-5 (minimal)OpenAI$1.25$10N/A23.9
GPT-5.1 (high)OpenAI$1.25$10N/A47.7
GPT-5 (high)OpenAI$1.25$10N/A44.6
GPT-5.1 Codex (high)OpenAI$1.25$10N/A43.1
GPT-5 (ChatGPT)OpenAI$1.25$10N/A21.8
GPT-5 Codex (high)OpenAI$1.25$10N/A44.6
Gemini 2.5 Pro Preview (May' 25)Google$1.25$10N/A29.5
Gemini 2.5 ProGoogle$1.25$101.0M34.6
DeepSeek V3 0324DeepSeek$1.25$1.45164K22.3
DeepSeek R1 (Jan '25)DeepSeek$1.35$4.00N/A18.8
DeepSeek R1 0528 (May '25)DeepSeek$1.35$5.40N/A27.1
Qwen3 Coder 480B A35B InstructAlibaba$1.50$7.50N/A24.8
Qwen2.5 MaxAlibaba$1.60$6.40N/A16.3
GPT-5.2 (xhigh)OpenAI$1.75$14N/A51.3
GPT-5.2 Codex (xhigh)OpenAI$1.75$14N/A49.0
GPT-5.3 Codex (xhigh)OpenAI$1.75$14N/A54.0
Jamba 1.5 LargeAI21 Labs$2.00$8.00N/A10.7
Gemini 3 Pro Preview (high)Google$2.00$12N/A48.4
Gemini 3.1 Pro PreviewGoogle$2.00$121.0M57.2
Jamba 1.7 LargeAI21 Labs$2.00$8.00N/A10.9
Mistral Large 2 (Nov '24)Mistral$2.00$6.00N/A15.1
Pixtral LargeMistral$2.00$6.00N/A14.0
Grok 4.20 Beta 0309 (Reasoning)xAI$2.00$6.00N/A48.5
Magistral Medium 1.2Mistral$2.00$5.00N/A27.1
GPT-4.1OpenAI$2.00$8.001.0M26.3
o3OpenAI$2.00$8.00200K38.4
Jamba 1.6 LargeAI21 Labs$2.00$8.00N/A10.6
GPT-5.4 (xhigh)OpenAI$2.50$15N/A57.2
Nova PremierAmazon$2.50$131M19.0
Command ACohere$2.50$10256K13.5
GPT-4o (Aug '24)OpenAI$2.50$10N/A18.6
Llama 3.1 Instruct 405BMeta$2.75$6.50N/A17.4
Mistral MediumMistral$2.75$8.10N/A9.0
Claude 4 Sonnet (Reasoning)Anthropic$3.00$15N/A38.7
Command-R+ (Apr '24)Cohere$3.00$15N/A8.3
Grok 4xAI$3.00$15256K41.5
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)Anthropic$3.00$15N/A51.7
Claude 3.7 Sonnet (Reasoning)Anthropic$3.00$15N/A34.7
Grok 3xAI$3.00$15131K25.2
Claude 3.5 Sonnet (Oct '24)Anthropic$3.00$15N/A15.9
Claude 4.5 Sonnet (Reasoning)Anthropic$3.00$15N/A43.0
Claude 3 SonnetAnthropic$3.00$15N/A10.3
Mistral Large (Feb '24)Mistral$4.00$12N/A9.9
Claude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic$5.00$25N/A53.0
Claude Opus 4.5 (Reasoning)Anthropic$5.00$25N/A49.7
GPT-4 TurboOpenAI$10$30128K13.7
Claude 3 OpusAnthropic$15$75200K18.0
Claude 4.1 Opus (Reasoning)Anthropic$15$75N/A42.0
o1OpenAI$15$60200K30.8
Claude 4 Opus (Reasoning)Anthropic$15$75N/A39.0
o1-previewOpenAI$17$66N/A23.7
o3-proOpenAI$20$80N/A40.7
GPT-4OpenAI$30$608K12.8
o1-proOpenAI$150$600200K25.8
Are free AI models any good?

Yes, there are 136 free models available, and some rank well on intelligence benchmarks. Free models are a great way to start, though premium models generally offer better quality for demanding tasks.