Question 1

How much does the OpenAI API cost per month?

Accepted Answer

Depends on volume and model. GPT-5.5 (the new flagship) is $5/M input, $30/M output. GPT-5.4 (mid-tier) is $2.50/$15. GPT-5 mini is $0.25/$2. For a chatbot doing 50K requests a month at 1.5K input tokens and 300 output tokens, GPT-5.4 lands around $300/month, GPT-5 mini around $30. Run the picker above with your actual numbers.

Question 2

How much does the Claude API cost per month?

Accepted Answer

Claude Opus 4.7 is $5/$25 per million tokens. Sonnet 4.6 is $3/$15. Haiku 4.5 is $0.25/$1.25. With prompt caching enabled (which Anthropic offers at 90% off cached reads), high-cache-rate workloads can drop input cost by 60-80%. The calculator above includes a cache-hit-rate slider so you can see the impact.

Question 3

Which AI API is cheapest?

Accepted Answer

Gemini 3 Flash at $0.075 input / $0.30 output per million tokens, then Llama 4 Scout at roughly the same on hosted providers. Among flagship-tier models, Gemini 2.5 Pro ($1.25/$5) is cheapest, then Gemini 3.1 Pro ($2/$12). Claude Opus 4.7 and GPT-5.5 are the most expensive flagships at $5 input each.

Question 4

How do I estimate AI cost for my use case?

Accepted Answer

Three numbers: monthly request count, average input tokens per request, average output tokens per request. Use one of the workload presets in the calculator as a starting point if you're not sure (chatbot, RAG, agentic coding, content writing, summarizer, high-volume API). Adjust from there.

Question 5

Should I include prompt caching in my estimate?

Accepted Answer

Only if you're going to use it. Prompt caching gives you 80-90% off the input price for tokens you've sent before, useful when system prompts or long stable prefixes repeat across requests. RAG and agentic-coding workloads benefit most. Set the cache-hit-rate slider to your realistic rate (60% is common for RAG, 70% for agentic coding, 0% for one-shot chat).

Question 6

What's a typical AI API budget for a small SaaS?

Accepted Answer

Wildly variable. A typical pattern: a small SaaS with 1K daily-active users, each making 5 LLM-backed actions per day, ends up at 150K monthly requests. At 2K input / 500 output average, that's around $1,200/month on GPT-5.4 and $400/month on GPT-5 mini. Cache caching aggressively or moving high-volume calls to a fast tier can cut the bill 70-90%.

Question 7

How does cost scale with users?

Accepted Answer

Roughly linearly with request volume, since pricing is per-token. Doubling your users typically doubles your bill (sometimes less, since system prompts cache hits go up at scale). Use the calculator to model 2x, 5x, 10x scenarios by changing the monthly-requests field.

Question 8

Are there hidden fees beyond per-token pricing?

Accepted Answer

Not on the major providers' standard tiers. Anthropic, OpenAI, and Google charge per-token only. Some providers add surcharges for batch jobs or volume tiers, usually as discounts not surcharges. Hosted Llama (Together, Fireworks, Groq, etc.) varies by provider, the calculator uses median hosted prices. Cache write cost is real on Claude (~$1.25/M tokens for cache writes vs $5/M for fresh reads); the calculator factors this into the cached input rate.

Model	Input	Output	Monthly	Annual	Per 1K req
Gemini 3 Flash· cheapest Google · fast	$1.50	$1.50	$3.00	$36.00	$0.300
Llama 4 Scout Meta · open	$1.60	$1.50	$3.10	$37.20	$0.310
Llama 4 Maverick· cheapest flagship Meta · flagship	$5.40	$4.25	$9.65	$116	$0.965
Claude Haiku 4.5 Anthropic · fast	$5.00	$6.25	$11.25	$135	$1.13
GPT-5 mini OpenAI · fast	$5.00	$10.00	$15.00	$180	$1.50
Gemini 2.5 Pro Google · flagship	$25.00	$25.00	$50.00	$600	$5.00
Gemini 3.1 Pro Google · flagship	$40.00	$60.00	$100	$1200	$10.00
GPT-5.4 OpenAI · mid	$50.00	$75.00	$125	$1500	$12.50
Claude Sonnet 4.6 Anthropic · mid	$60.00	$75.00	$135	$1620	$13.50
Claude Opus 4.7 Anthropic · flagship	$100	$125	$225	$2700	$22.50
GPT-5.5 OpenAI · flagship	$100	$150	$250	$3000	$25.00

AI cost calculator.

How to read the table

How much does ChatGPT cost per month?

How much does Claude cost per month?

How to calculate AI API cost for a chatbot

When prompt caching changes the math

Use it from inside Claude or Cursor

When this is the wrong tool

FAQ