Compare LLM API pricing across 25 models: GPT‑5.5, Claude 3.7, Gemini 2.0, DeepSeek V4 & more. Know your AI costs before you send.
Current pricing from official API providers. All prices per 1M tokens.
| Model | Input $/1M | Output $/1M | Context |
|---|---|---|---|
OpenAIGPT-5.5 | $5.00 | $30.00 | 270K |
GPT-5.4 | $2.50 | $15.00 | 270K |
GPT-5.4 Mini | $0.750 | $4.50 | 270K |
GPT-4o | $2.50 | $10.00 | 128K |
GPT-4o MiniCheapest | $0.150 | $0.600 | 128K |
o3 | $2.00 | $8.00 | 200K |
o4-mini | $1.10 | $4.40 | 200K |
AnthropicClaude 3.7 Sonnet | $3.00 | $15.00 | 200K |
Claude 3.5 Haiku | $0.800 | $4.00 | 200K |
Claude 3 Opus | $15.00 | $75.00 | 200K |
Claude 3 HaikuCheapest | $0.250 | $1.25 | 200K |
Claude Opus 4.8 | $5.00 | $25.00 | 200K |
Claude Sonnet 4.6 | $3.00 | $15.00 | 200K |
Claude Haiku 4.5 | $1.00 | $5.00 | 200K |
GoogleGemini 2.0 Flash | $0.100 | $0.400 | 1M |
Gemini 2.0 Flash-Lite | $0.075 | $0.300 | 1M |
Gemini 1.5 Pro | $1.25 | $5.00 | 2M |
Gemini 1.5 Flash | $0.075 | $0.300 | 1M |
Gemini 1.5 Flash-8BCheapest | $0.037 | $0.150 | 1M |
Gemini 3.5 Flash | $1.50 | $9.00 | 1M |
Gemini 2.5 Flash | $0.300 | $2.50 | 1M |
Gemini 2.5 Flash-Lite | $0.100 | $0.400 | 1M |
DeepSeekDeepSeek V4 FlashCheapest | $0.140 | $0.280 | 1M |
DeepSeek V4 Pro | $0.435 | $0.870 | 1M |
GroqLlama 4 MaverickCheapest | $0.200 | $0.600 | 128K |
Prices last verified: May 30, 2026. Source: official provider pricing pages.
25 models across 5 providers
LLM API pricing varies dramatically across providers. As of June 2026, the cheapest text model (Gemini 2.5 Flash-Lite) costs $0.10 per 1M input tokens, while the most expensive (Claude 3 Opus) costs $15.00 — a 150× difference for the same text.
OpenAI's GPT-5.5 leads at $5/$30 per 1M tokens for complex reasoning tasks. For budget-conscious developers, DeepSeek V4 Flash offers strong performance at $0.14/$0.28, and Google's Gemini 2.5 Flash provides production-ready quality at $0.30/$2.50. The table above shows current pricing for all major providers.
LLM providers charge per token, not per word or character. One token is roughly 4 characters of English text, or about ¾ of a word. A typical 1,000-word article uses about 1,300 tokens. Most providers charge separately for input tokens (your prompt) and output tokens (the model's response), with output tokens typically costing 3–6× more than input.
For example, sending a 500-token prompt to GPT-5.4 costs $0.00125 in input fees. If the model generates 1,000 tokens in response, that adds $0.015 in output fees. At 1,000 calls per month, this works out to roughly $16.25/month. Use the calculator above to estimate costs for your actual usage patterns.
Pick your model based on task complexity and budget. For simple tasks (classification, extraction, short answers), start with GPT-5.4 Mini ($0.75/$4.50) or Gemini 2.5 Flash-Lite ($0.10/$0.40). These models handle 90% of production workloads at a fraction of the cost.
For complex reasoning, coding, or professional work, step up to GPT-5.4 ($2.50/$15) or Claude 3.7 Sonnet ($3/$15). Reserve the most expensive models — GPT-5.5 ($5/$30) and Claude Opus 4.8 ($5/$25) — for tasks that genuinely need frontier intelligence. A practical approach: prototype with cheap models, then upgrade only where quality demands it.
Drop in the prompt you plan to send. Any length works.
Adjust monthly call volume and expected output length.
See costs for all major AI models. Pick the cheapest.
Hand-picked free tools, source code included. No spam. Unsubscribe anytime.
By subscribing, you agree to receive emails from aicalc.cloud. Unsubscribe anytime.