LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Invalid Date
Invalid Date
LLMPrice$/Mtok
Pricing / Cheapest

Cheapest LLM APIs

153 models ranked by blended cost — the average of input and output per million tokens. Lower is cheaper.

Top 10 by blended cost

Lowest token prices right now

$0/M
$0.02/M
$0.02/M
$0.02/M
$0.025/M
$0.03/M
$0.05/M
$0.06/M
$0.065/M
153 models · sorted by blended cost (cheapest first)
Model Provider Input $/Mtok Output $/Mtok Blended Context Category
GLM-4.7-Flash Zhipu $0.000 $0.000
$0.000
128K Open
Rerank 3.5 Cohere $0.020 $0.020
$0.020
text-embedding-3-small OpenAI $0.020 $0.020
$0.020
8K
rerank-2.5-lite Voyage AI $0.020 $0.020
$0.020
voyage-4-lite Voyage AI $0.020 $0.020
$0.020
text-embedding-004 Google $0.025 $0.025
$0.025
2K
GLM-OCR Zhipu $0.030 $0.030
$0.030
128K Open
rerank-2.5 Voyage AI $0.050 $0.050
$0.050
voyage-4 Voyage AI $0.060 $0.060
$0.060
Granite 4.0 Micro IBM $0.017 $0.112
$0.065
128K Open
Llama 3.1 8B Meta $0.050 $0.080
$0.065
128K Open
Baichuan M2-32B Baichuan $0.070 $0.070
$0.070
33K Open
LFM2 24B A2B Together $0.030 $0.120
$0.075
128K Open
Nova Micro Amazon $0.035 $0.140
$0.088
128K
Ministral 3 3B Mistral $0.100 $0.100
$0.100
128K Open
Pixtral 12B Mistral $0.100 $0.100
$0.100
128K Open
Nemotron 70B Instruct NVIDIA $0.100 $0.100
$0.100
128K Open
Reka Edge Reka $0.100 $0.100
$0.100
66K
GLM-4-32B-0414 Zhipu $0.100 $0.100
$0.100
128K Open
Granite Embedding 278M Multilingual IBM $0.106 $0.106
$0.106
Open
Embed 4 Cohere $0.120 $0.120
$0.120
voyage-4-large Voyage AI $0.120 $0.120
$0.120
voyage-multimodal-3.5 Voyage AI $0.120 $0.120
$0.120
Qwen-Turbo Alibaba $0.050 $0.200
$0.125
1M
text-embedding-3-large OpenAI $0.130 $0.130
$0.130
8K
Mistral Small 3.2 24B Mistral $0.080 $0.200
$0.140
128K Open
Nova Lite Amazon $0.060 $0.240
$0.150
300K
Granite 4 H Small IBM $0.060 $0.250
$0.155
128K Open
voyage-code-3 Voyage AI $0.180 $0.180
$0.180
32K
voyage-context-3 Voyage AI $0.180 $0.180
$0.180
32K
GPT OSS 20B Fireworks $0.070 $0.300
$0.185
128K Open
Gemini 2.5 Flash Google $0.075 $0.300
$0.188
1M
Voxtral Small 24B Mistral $0.100 $0.300
$0.200
128K Open
DeepSeek V4 Flash DeepSeek $0.140 $0.280
$0.210
1M
DeepSeek V4 Flash Fireworks $0.140 $0.280
$0.210
1M
Llama 4 Scout Meta $0.110 $0.340
$0.225
10M Open
GLM-4.7-FlashX Zhipu $0.070 $0.400
$0.235
128K Open
Gemini 2.5 Flash-Lite Google $0.100 $0.400
$0.250
1M
GPT-4.1 nano OpenAI $0.100 $0.400
$0.250
1M
Qwen-Flash Alibaba $0.115 $0.460
$0.288
1M
Jamba Mini AI21 Labs $0.200 $0.400
$0.300
256K Open
Grok 4.1 Fast xAI $0.200 $0.500
$0.350
2M
Command R 08-2024 Cohere $0.150 $0.600
$0.375
128K
GPT OSS 120B Fireworks $0.150 $0.600
$0.375
128K Open
Granite 4 H Medium IBM $0.150 $0.600
$0.375
128K Open
Mistral Small 4 Mistral $0.150 $0.600
$0.375
128K
gpt-oss-120B Together $0.150 $0.600
$0.375
128K Open
Llama 3.1 8B Instant Groq $0.050 $1.00
$0.525
128K Open 840 TPS
GPT-OSS 20B Groq $0.075 $1.00
$0.537
128K Open 1000 TPS
Mixtral 8x7B Instruct Mistral $0.540 $0.540
$0.540
32K Open
Llama 4 Scout Groq $0.110 $1.00
$0.555
10M Open 594 TPS
Gemma-4-31B-it-Pearl Together $0.280 $0.860
$0.570
128K Open
GPT-OSS 120B Groq $0.150 $1.00
$0.575
128K Open 500 TPS
Codestral Mistral $0.300 $0.900
$0.600
256K
Codestral 2508 Mistral $0.300 $0.900
$0.600
256K
GLM-4.6V Zhipu $0.300 $0.900
$0.600
128K Open
Qwen3 32B Groq $0.290 $1.00
$0.645
128K Open 662 TPS
GLM-4.5-Air Zhipu $0.200 $1.10
$0.650
128K Open
DeepSeek V4 Pro DeepSeek $0.435 $0.870
$0.652
1M
Llama 3.3 70B Meta $0.590 $0.790
$0.690
128K Open
MiniMax 2.5 Fireworks $0.300 $1.20
$0.750
128K Open
MiniMax 2.7 Fireworks $0.300 $1.20
$0.750
128K Open
MiniMax M3 Fireworks $0.300 $1.20
$0.750
1M Open
Granite 4 H Large IBM $0.300 $1.20
$0.750
128K Open
MiniMax-M2 MiniMax $0.300 $1.20
$0.750
205K Open
MiniMax-M2.1 MiniMax $0.300 $1.20
$0.750
205K Open
MiniMax-M2.5 MiniMax $0.300 $1.20
$0.750
205K Open
MiniMax-M2.7 MiniMax $0.300 $1.20
$0.750
205K Open
MiniMax-M3 MiniMax $0.300 $1.20
$0.750
1M Open
MiniMax M2.5 Together $0.300 $1.20
$0.750
128K Open
MiniMax M3 Together $0.300 $1.20
$0.750
1M Open
Llama 3.3 70B Versatile Groq $0.590 $1.00
$0.795
128K Open 394 TPS
Qwen-Plus Alibaba $0.400 $1.20
$0.800
131K
Qwen 3.6 27B Groq $0.600 $1.00
$0.800
128K Open 500 TPS
Qwen3.7-Plus Together $0.320 $1.28
$0.800
128K Open
Gemini 3.1 Flash-Lite Google $0.250 $1.50
$0.875
1M
Command R 03-2024 Cohere $0.500 $1.50
$1.00
128K
Qwen 3.7 Plus Fireworks $0.400 $1.60
$1.00
128K Open
Mistral Large 3 Mistral $0.500 $1.50
$1.00
128K
GPT-4.1 mini OpenAI $0.400 $1.60
$1.00
1M
Sonar Perplexity $1.00 $1.00
$1.00
200K
Llama 3.3 70B Together $1.04 $1.04
$1.04
128K Open
Devstral 2 2512 Mistral $0.400 $2.00
$1.20
256K Open
Mistral Medium 3 Mistral $0.400 $2.00
$1.20
128K
Gemini 3.1 Flash Google $0.300 $2.50
$1.40
1M
Reka Flash Reka $0.800 $2.00
$1.40
128K
GLM-4.5 Zhipu $0.600 $2.20
$1.40
128K Open
GLM-4.6 Zhipu $0.600 $2.20
$1.40
128K Open
GLM-4.7 Zhipu $0.600 $2.20
$1.40
128K Open
Command A Cohere $1.00 $2.00
$1.50
256K
NVIDIA Nemotron 3 Ultra Fireworks $0.600 $2.40
$1.50
128K Open
Grok Build 0.1 xAI $1.00 $2.00
$1.50
256K
QwQ-Plus Alibaba $0.800 $2.40
$1.60
131K
Qwen 3.6 Plus Fireworks $0.500 $3.00
$1.75
128K Open
Kimi K2.5 Fireworks $0.600 $3.00
$1.80
256K Open
Kimi K2.5 Moonshot $0.600 $3.00
$1.80
262K Open
Grok 4.3 xAI $1.25 $2.50
$1.88
1M
Nova Pro Amazon $0.800 $3.20
$2.00
300K
Llama Nemotron Ultra 253B NVIDIA $0.600 $3.60
$2.10
128K Open
Nemotron 3 Ultra NVIDIA $0.600 $3.60
$2.10
128K Open
NVIDIA Nemotron 3 Ultra Together $0.600 $3.60
$2.10
128K Open
Qwen3.5-397B-A17B Together $0.600 $3.60
$2.10
128K Open
GLM-5 Zhipu $1.00 $3.20
$2.10
128K Open
Kimi K2.6 Fireworks $0.950 $4.00
$2.48
256K Open
Kimi K2.7 Code Fireworks $0.950 $4.00
$2.48
256K Open
Kimi K2.6 Moonshot $0.950 $4.00
$2.48
262K Open
Kimi K2.7 Code Moonshot $0.950 $4.00
$2.48
262K Open
Kimi K2.7 Code Together $0.950 $4.00
$2.48
256K Open
Qwen3.7-Max Together $1.25 $3.75
$2.50
128K Open
GLM-5-Turbo Zhipu $1.20 $4.00
$2.60
128K Open
GLM-5V-Turbo Zhipu $1.20 $4.00
$2.60
128K Open
DeepSeek V4 Pro Fireworks $1.74 $3.48
$2.61
1M
DeepSeek V4 Pro Together $1.74 $3.48
$2.61
1M
GPT-5.4 mini OpenAI $0.750 $4.50
$2.63
1M
o3-mini OpenAI $1.10 $4.40
$2.75
200K
o4-mini OpenAI $1.10 $4.40
$2.75
200K
GLM 5.1 Fireworks $1.40 $4.40
$2.90
128K Open
GLM 5.2 Fireworks $1.40 $4.40
$2.90
128K Open
GLM-5.2 Together $1.40 $4.40
$2.90
128K Open
GLM-5.1 Zhipu $1.40 $4.40
$2.90
128K Open
GLM-5.2 Zhipu $1.40 $4.40
$2.90
128K Open
Qwen3-Max Alibaba $1.20 $4.80
$3.00
262K
Claude Haiku 4.5 Anthropic $1.00 $5.00
$3.00
200K
Magistral Medium Mistral $2.00 $5.00
$3.50
128K
Mixtral 8x22B Instruct Mistral $2.00 $6.00
$4.00
64K Open
Pixtral Large 2411 Mistral $2.00 $6.00
$4.00
128K Open
Reka Core Reka $2.00 $6.00
$4.00
128K
Grok 4.20 xAI $2.00 $6.00
$4.00
256K
Mistral Medium 3.5 Mistral $1.50 $7.50
$4.50
128K
Kimi K2.7 Code HighSpeed Moonshot $1.90 $8.00
$4.95
262K Open
Jamba Large AI21 Labs $2.00 $8.00
$5.00
256K Open
GPT-4.1 OpenAI $2.00 $8.00
$5.00
1M
Sonar Deep Research Perplexity $2.00 $8.00
$5.00
200K
Sonar Reasoning Pro Perplexity $2.00 $8.00
$5.00
200K
Gemini 3.5 Flash Google $1.50 $9.00
$5.25
1M
Gemini 2.5 Pro Google $1.25 $10.00
$5.63
2M
Yi Large 01.AI $3.00 $9.00
$6.00
32K
Command R+ 08-2024 Cohere $2.50 $10.00
$6.25
128K
Gemini 3.1 Pro Google $2.00 $12.00
$7.00
2M
Nova Premier Amazon $2.50 $12.50
$7.50
1M
GPT-5.4 OpenAI $2.50 $15.00
$8.75
1M
Claude Sonnet 4.6 Anthropic $3.00 $15.00
$9.00
200K
Sonar Pro Perplexity $3.00 $15.00
$9.00
200K
Grok 4 xAI $3.00 $15.00
$9.00
256K
GPT-Realtime-2 OpenAI $4.00 $24.00
$14.00
128K
Claude Opus 4.5 Anthropic $5.00 $25.00
$15.00
200K
Claude Opus 4.6 Anthropic $5.00 $25.00
$15.00
200K
Claude Opus 4.7 Anthropic $5.00 $25.00
$15.00
200K
Claude Opus 4.8 Anthropic $5.00 $25.00
$15.00
200K
Claude Mythos 5 Anthropic $10.00 $20.00
$15.00
200K
GPT-5.5 OpenAI $5.00 $30.00
$17.50
270K
GPT-Image-2 OpenAI $8.00 $30.00
$19.00
128K
Claude Fable 5 Anthropic $10.00 $50.00
$30.00
200K

Blended cost = (input + output) / 2 per 1M tokens. It's the simplest way to compare models when your usage mixes both. The cheapest models are often small open-weight models hosted on Groq or Together, but they may lack the reasoning power of frontier models. Use the Cost Calculator to estimate real monthly spending based on your actual input/output ratio.