LIVE Cheapest: GLM-4.7-Flash $0.000/Mtok in 153 models tracked Updated Invalid Date
Invalid Date
LLMPrice$/Mtok
Directory

LLM API Providers

24 providers tracked — commercial labs, open-source model makers, and inference hosting platforms.

All types Commercial Open source Hosting
commercial
Maker of GPT and the OpenAI API. Offers frontier models (GPT-5.5, GPT-5.4), reasoning (o4-mini), and the open-source gpt-oss famil…
12
Models
$0.020
Cheapest blended
View all OpenAI models →
commercial
Maker of the Claude model family. Flagship Fable 5 and Opus 4.8, plus Sonnet and Haiku tiers. Known for safety research.
8
Models
$3.00
Cheapest blended
View all Anthropic models →
commercial
Maker of Gemini multimodal models with up to 2M context. Current flagship Gemini 3.5 Flash; also open-weights Gemma.
8
Models
$0.025
Cheapest blended
View all Google models →
commercial
Maker of Grok models. Current flagship Grok 4.3 offers 1M context at $1.25/$2.50 per 1M tokens — strong value.
5
Models
$0.350
Cheapest blended
View all xAI models →
commercial
Chinese AI lab offering extremely cheap APIs with aggressive prompt caching. V4 Flash at $0.14/$0.28 per 1M tokens.
2
Models
$0.210
Cheapest blended
View all DeepSeek models →
commercial
European AI lab. Mistral Large 3 at $0.50/$1.50 per 1M tokens, plus Codestral for code and open-weight models.
15
Models
$0.100
Cheapest blended
View all Mistral models →
commercial
Maker of the Nova model family on AWS. Nova Micro at $0.035/$0.14 per 1M tokens is among the cheapest hosted APIs.
4
Models
$0.088
Cheapest blended
View all Amazon models →
commercial
Enterprise-focused lab. Command A flagship at $1/$2 per 1M tokens — a major price drop vs. the older Command R+.
6
Models
$0.020
Cheapest blended
View all Cohere models →
open
Maker of the open-weight Llama family. Llama 4 Scout (17B, 16 experts, 10M context) and Llama 3.3 70B are widely hosted.
3
Models
$0.065
Cheapest blended
View all Meta models →
hosting
Inference platform running open models on custom LPU chips for extreme speed. Up to 1000 tokens/sec, cheapest hosted prices for ma…
7
Models
$0.525
Cheapest blended
View all Groq models →
hosting
Inference platform hosting 200+ open-source models with simple per-token pricing. Strong coverage of Qwen, GLM, Kimi, MiniMax, and…
13
Models
$0.075
Cheapest blended
View all Together models →
hosting
Fireworks AI models.
0
Models
Cheapest
View all Fireworks AI models →
commercial
Perplexity AI models.
4
Models
$1.00
Cheapest blended
View all Perplexity models →
hosting
NVIDIA AI models.
3
Models
$0.100
Cheapest blended
View all NVIDIA models →
commercial
IBM AI models.
5
Models
$0.065
Cheapest blended
View all IBM models →
commercial
AI21 Labs AI models.
2
Models
$0.300
Cheapest blended
View all AI21 Labs models →
commercial
Reka AI models.
3
Models
$0.100
Cheapest blended
View all Reka models →
commercial
Voyage AI AI models.
8
Models
$0.020
Cheapest blended
View all Voyage AI models →
commercial
Alibaba AI models.
5
Models
$0.125
Cheapest blended
View all Alibaba models →
commercial
Zhipu AI models.
14
Models
$0.000
Cheapest blended
View all Zhipu models →
commercial
Moonshot AI models.
4
Models
$1.80
Cheapest blended
View all Moonshot models →
commercial
MiniMax AI models.
5
Models
$0.750
Cheapest blended
View all MiniMax models →
commercial
01.AI AI models.
1
Models
$6.00
Cheapest blended
View all 01.AI models →
commercial
Baichuan AI models.
1
Models
$0.070
Cheapest blended
View all Baichuan models →