Per-message cost = (input tokens × input price) + (output tokens × output price). We multiply by your messages-per-user-per-day and your DAU to project daily, monthly, and annual spend. If you turn on prompt caching, the cached portion of input tokens gets the provider's cache-hit discount applied (typically 10x cheaper).
What this calculator does NOT include
Image, audio, or video token costs (those are priced separately by every provider)
Fine-tuning training costs
Vector database / embedding costs (~$0.02 per 1M embeddings on most providers)
Your hosting bill (typically <5% of inference cost for serverless deployments)
Pricing sources
Numbers approximate public list prices as of 2026. Authoritative sources: Anthropic, OpenAI, Google.