Question 1

What is LLM API pricing?

Accepted Answer

LLM API pricing is how AI providers charge for using their large language models. Most providers charge per token (roughly 4 characters of English text), with separate rates for input tokens (your prompt) and output tokens (the model's response). Prices range from $0.0375 per 1M tokens for the cheapest models to $75 per 1M tokens for the most capable ones.

Question 2

How much does GPT-5.5 cost per 1M tokens?

Accepted Answer

As of May 2026, GPT-5.5 costs $5.00 per 1M input tokens and $30.00 per 1M output tokens (standard processing). The more affordable GPT-5.4 costs $2.50/$15.00, and GPT-5.4 Mini costs $0.75/$4.50. Cached input tokens are available at 90% discount.

Question 3

Which LLM has the cheapest API?

Accepted Answer

Google's Gemini 1.5 Flash-8B has the cheapest LLM API at $0.0375/$0.15 per 1M tokens. For production-quality models, Gemini 2.0 Flash ($0.10/$0.40) and DeepSeek V4 Flash ($0.14/$0.28) offer the best value. Groq's Llama 4 Maverick provides ultra-fast inference at $0.20/$0.60.

Question 4

How do I compare LLM costs?

Accepted Answer

Use this calculator to compare LLM costs: paste your prompt, set your monthly call volume, and see side-by-side pricing for all major providers. The static pricing table above shows current rates for all 19 models. Key factors: input vs output pricing, context window size, and your actual usage patterns.

Question 5

What is the cost per token for Claude?

Accepted Answer

Anthropic's Claude models are priced per 1M tokens: Claude 3.7 Sonnet costs $3.00 input / $15.00 output, Claude 3.5 Haiku costs $0.80 / $4.00, Claude 3 Opus costs $15.00 / $75.00, and Claude 3 Haiku costs $0.25 / $1.25. Per individual token, Claude 3.7 Sonnet costs $0.000003 per input token.

Question 6

How accurate is the token count?

Accepted Answer

For OpenAI models (GPT-4o, o3, o4-mini), we use tiktoken for exact counts. For Claude, Gemini, and other models, we estimate based on approximately 4 characters per token for English text and 1.5 characters for Chinese text.

Question 7

How often is pricing updated?

Accepted Answer

Pricing data is sourced directly from official provider pages and verified regularly. AI providers change pricing frequently — check the 'Prices last verified' date in the comparison table above.

Question 8

Is this tool free?

Accepted Answer

Yes, completely free. No login required. No API keys needed. All calculations happen in your browser.

Question 9

What is a token?

Accepted Answer

A token is the basic unit that AI models use to process text. Roughly, 1 token equals 4 characters in English or about 0.75 words. AI providers charge based on the number of tokens processed.

Question 10

How do I reduce my AI costs?

Accepted Answer

Three strategies: Use cheaper models for simple tasks (GPT-4o Mini at $0.15/$0.60 or Gemini 2.0 Flash at $0.10/$0.40), shorten prompts to reduce input tokens, and use cached input where available (OpenAI offers 90% discount on cached tokens).

Model	Input $/1M	Output $/1M	Context	Best For
OpenAIGPT-5.5	$5.00	$30.00	270K	Complex reasoning, coding, professional work
GPT-5.4	$2.50	$15.00	270K	Balanced performance and cost
GPT-5.4 Mini	$0.750	$4.50	270K	Fast coding, subagents, computer use
GPT-4o	$2.50	$10.00	128K	General-purpose, multimodal tasks
GPT-4o MiniCheapest	$0.150	$0.600	128K	High-volume, low-cost tasks
o3	$2.00	$8.00	200K	Deep reasoning, STEM problems
o4-mini	$1.10	$4.40	200K	Affordable reasoning tasks
AnthropicClaude 3.7 Sonnet	$3.00	$15.00	200K	Complex analysis, coding, extended thinking
Claude 3.5 Haiku	$0.800	$4.00	200K	Fast responses, high-throughput tasks
Claude 3 Opus	$15.00	$75.00	200K	Most demanding tasks, top-tier intelligence
Claude 3 HaikuCheapest	$0.250	$1.25	200K	Ultra-fast, cost-sensitive applications
Claude Opus 4.8	$5.00	$25.00	200K	Most capable Claude model, complex reasoning
Claude Sonnet 4.6	$3.00	$15.00	200K	Balanced performance and cost, coding
Claude Haiku 4.5	$1.00	$5.00	200K	Fast responses, cost-effective Claude
GoogleGemini 2.0 Flash	$0.100	$0.400	1M	Production-ready, high rate limits
Gemini 2.0 Flash-Lite	$0.075	$0.300	1M	Large-scale text output, lowest cost
Gemini 1.5 Pro	$1.25	$5.00	2M	2M context, complex multimodal tasks
Gemini 1.5 Flash	$0.075	$0.300	1M	Fast multimodal, diverse repetitive tasks
Gemini 1.5 Flash-8BCheapest	$0.037	$0.150	1M	Smallest model, lowest intelligence use cases
Gemini 3.5 Flash	$1.50	$9.00	1M	Most capable Gemini, advanced reasoning
Gemini 2.5 Flash	$0.300	$2.50	1M	Balanced Gemini model, good value
Gemini 2.5 Flash-Lite	$0.100	$0.400	1M	Lowest cost Gemini, high-volume tasks
DeepSeekDeepSeek V4 FlashCheapest	$0.140	$0.280	1M	Best value, strong reasoning at low cost
DeepSeek V4 Pro	$0.435	$0.870	1M	DeepSeek's most capable model
GroqLlama 4 MaverickCheapest	$0.200	$0.600	128K	Ultra-fast inference on Groq LPU

AI Prompt Cost
Calculator

LLM API Pricing Comparison

LLM API Pricing Comparison (2026)

LLM Cost per Token Explained

How to Choose the Right LLM API

How the AI cost calculator works.

Paste your prompt

Set your usage

Compare & save

Frequently asked questions.

Get one new dev tool every Friday.