AI Prompt Cost
Calculator

Compare LLM API pricing across 25 models: GPT‑5.5, Claude 3.7, Gemini 2.0, DeepSeek V4 & more. Know your AI costs before you send.

Type or paste your prompt above
1K
101M
1K
016K

LLM API Pricing Comparison

Current pricing from official API providers. All prices per 1M tokens.

ModelInput $/1MOutput $/1MContext
OpenAIGPT-5.5
$5.00$30.00270K
GPT-5.4
$2.50$15.00270K
GPT-5.4 Mini
$0.750$4.50270K
GPT-4o
$2.50$10.00128K
GPT-4o MiniCheapest
$0.150$0.600128K
o3
$2.00$8.00200K
o4-mini
$1.10$4.40200K
AnthropicClaude 3.7 Sonnet
$3.00$15.00200K
Claude 3.5 Haiku
$0.800$4.00200K
Claude 3 Opus
$15.00$75.00200K
Claude 3 HaikuCheapest
$0.250$1.25200K
Claude Opus 4.8
$5.00$25.00200K
Claude Sonnet 4.6
$3.00$15.00200K
Claude Haiku 4.5
$1.00$5.00200K
GoogleGemini 2.0 Flash
$0.100$0.4001M
Gemini 2.0 Flash-Lite
$0.075$0.3001M
Gemini 1.5 Pro
$1.25$5.002M
Gemini 1.5 Flash
$0.075$0.3001M
Gemini 1.5 Flash-8BCheapest
$0.037$0.1501M
Gemini 3.5 Flash
$1.50$9.001M
Gemini 2.5 Flash
$0.300$2.501M
Gemini 2.5 Flash-Lite
$0.100$0.4001M
DeepSeekDeepSeek V4 FlashCheapest
$0.140$0.2801M
DeepSeek V4 Pro
$0.435$0.8701M
GroqLlama 4 MaverickCheapest
$0.200$0.600128K

Prices last verified: May 30, 2026. Source: official provider pricing pages.

25 models across 5 providers

LLM API Pricing Comparison (2026)

LLM API pricing varies dramatically across providers. As of June 2026, the cheapest text model (Gemini 2.5 Flash-Lite) costs $0.10 per 1M input tokens, while the most expensive (Claude 3 Opus) costs $15.00 — a 150× difference for the same text.

OpenAI's GPT-5.5 leads at $5/$30 per 1M tokens for complex reasoning tasks. For budget-conscious developers, DeepSeek V4 Flash offers strong performance at $0.14/$0.28, and Google's Gemini 2.5 Flash provides production-ready quality at $0.30/$2.50. The table above shows current pricing for all major providers.

LLM Cost per Token Explained

LLM providers charge per token, not per word or character. One token is roughly 4 characters of English text, or about ¾ of a word. A typical 1,000-word article uses about 1,300 tokens. Most providers charge separately for input tokens (your prompt) and output tokens (the model's response), with output tokens typically costing 3–6× more than input.

For example, sending a 500-token prompt to GPT-5.4 costs $0.00125 in input fees. If the model generates 1,000 tokens in response, that adds $0.015 in output fees. At 1,000 calls per month, this works out to roughly $16.25/month. Use the calculator above to estimate costs for your actual usage patterns.

How to Choose the Right LLM API

Pick your model based on task complexity and budget. For simple tasks (classification, extraction, short answers), start with GPT-5.4 Mini ($0.75/$4.50) or Gemini 2.5 Flash-Lite ($0.10/$0.40). These models handle 90% of production workloads at a fraction of the cost.

For complex reasoning, coding, or professional work, step up to GPT-5.4 ($2.50/$15) or Claude 3.7 Sonnet ($3/$15). Reserve the most expensive models — GPT-5.5 ($5/$30) and Claude Opus 4.8 ($5/$25) — for tasks that genuinely need frontier intelligence. A practical approach: prototype with cheap models, then upgrade only where quality demands it.

How the AI cost calculator works.

01

Paste your prompt

Drop in the prompt you plan to send. Any length works.

02

Set your usage

Adjust monthly call volume and expected output length.

03

Compare & save

See costs for all major AI models. Pick the cheapest.

Frequently asked questions.

How accurate is the token count?
For OpenAI models (GPT-5.5, GPT-5.4, o3, o4-mini), we use tiktoken for exact counts. For Claude, Gemini, and other models, we estimate based on ~4 characters per token for English text and ~1.5 characters for Chinese text. The cost estimates are accurate enough for budgeting purposes.
Which AI models are supported?
We support 25 models across 5 providers: OpenAI (GPT-5.5, GPT-5.4, GPT-5.4 Mini, GPT-4o, GPT-4o Mini, o3, o4-mini), Anthropic (Claude Opus 4.8, Sonnet 4.6, Haiku 4.5, 3.7 Sonnet, 3.5 Haiku, 3 Opus, 3 Haiku), Google (Gemini 3.5 Flash, 2.5 Flash, 2.5 Flash-Lite, 2.0 Flash, 2.0 Flash-Lite, 1.5 Pro, 1.5 Flash, 1.5 Flash-8B), DeepSeek (V4 Flash, V4 Pro), and Groq (Llama 4 Maverick).
How often is pricing updated?
Pricing data is updated weekly from official provider pricing pages. AI providers change their pricing frequently, so we strive to keep the data current.
Is this tool free?
Yes, completely free. No login required. No API keys needed. All calculations happen in your browser.
What is a token?
A token is the basic unit that AI models use to process text. Roughly, 1 token equals 4 characters in English or about 0.75 words. A typical sentence is 10–20 tokens. AI providers charge based on the number of tokens processed.
How do I reduce my AI costs?
Three strategies: (1) Use cheaper models for simple tasks (e.g., GPT-5.4 Mini instead of GPT-5.5). (2) Shorten your prompts to reduce input tokens. (3) Use models with free tiers for testing (Gemini Flash, Groq).
Can I use this for batch calculations?
Not yet. Batch calculation (uploading a CSV of prompts) is planned for a future version.
Does this include batch API pricing?
Currently we show standard API pricing. Batch API pricing (typically 50% cheaper) is not yet included but is coming soon.

Get one new dev tool every Friday.

Hand-picked free tools, source code included. No spam. Unsubscribe anytime.

By subscribing, you agree to receive emails from aicalc.cloud. Unsubscribe anytime.