Loading…
Loading…
Written by Max Zeshut
Founder at Agentmelt · Last updated May 26, 2026
The price charged by LLM API providers for processing one token of input or generating one token of output. Input tokens (the prompt and context) are typically cheaper than output tokens (the model's response). As of 2026, pricing ranges from $0.08/M tokens for small models to $15/M tokens for frontier reasoning models. Understanding cost-per-token is essential for estimating AI agent operating costs, choosing the right model tier, and implementing cost optimization strategies like caching, routing, and batching.
A support agent handles 5,000 conversations/day, averaging 2,000 input tokens and 500 output tokens per conversation. At $3/M input and $15/M output tokens: daily cost = $30 + $37.50 = $67.50/day. Implementing semantic caching for common questions reduces this by 40% to ~$40/day.