Compare inference costs across GPT-5, Claude 4.5, Gemini 3, and more.
Now with agentic loops, caching, and reasoning tokens
In 2026, most AI applications use multi-step "agentic loops" rather than single prompts. Calculate costs for workflows like: "5 research steps + 1 summary step = 6 total steps."
Modern APIs (Anthropic, Gemini, OpenAI) offer context caching that reduces input costs by up to 90% for repeated data. Set your cache hit rate to see potential savings.
Reasoning models (like OpenAI's o-series) "think" before responding. These "thinking tokens" cost extra money and are often missed by older calculators. Enable reasoning to see the true cost.
Enable context caching to save up to 90% on repeated inputs
Visual comparison showing which model performs best for different use cases
| Model | Input Price | Output Price | Context | Total Cost | vs. Lowest |
|---|---|---|---|---|---|
Gemini 3 ProGoogle | $1.25/1M | $10.00/1M | 2.0M | $0.0825 | Lowest |
GPT-5OpenAI | $1.75/1M | $14.00/1M | 256.0K | $0.1155 | +$0.0330 (40%) |
Claude 4.5 SonnetAnthropic | $3.00/1M | $15.00/1M | 200.0K | $0.1800 | +$0.0975 (118%) |
Copy-paste code to integrate Gemini 3 Pro into your application
import google.generativeai as genai
genai.configure(api_key="your-api-key")
model = genai.GenerativeModel('gemini-3-pro')
response = model.generate_content(
"Your prompt here",
generation_config={
"max_output_tokens": 2000
}
)
print(response.text)💡 Replace "your-api-key" with your actual API key. Adjust tokens and parameters as needed.
Make informed decisions about which AI API to use by comparing real-time pricing across GPT-5, Claude 4.5, Gemini 3, and more. Our comprehensive calculator helps you estimate costs for text, image, video, and audio processing, including agentic loops, context caching, and reasoning tokens.
Compare costs across 9+ AI models with up-to-date pricing for input, output, and cached tokens. Calculate total expenses including agentic workflows and reasoning overhead.
Estimate costs for text, image, video, and audio processing. Compare pricing across different input modalities to find the best solution for your use case.
Factor in context caching savings, agentic loop costs, and reasoning token overhead to get accurate total cost estimates for your AI applications.
Choosing the right AI API can significantly impact your application's costs. With pricing differences of up to 20x between models, a thorough cost comparison is essential. Our calculator helps you:
Compare costs across major AI providers including OpenAI (GPT-5, GPT-5 Mini, o3), Anthropic (Claude 4.5, Claude 4 Opus), Google (Gemini 3 Pro, Gemini 3 Flash), xAI (Grok-4), and Meta (Llama 4 Maverick). Each model's pricing is calculated based on publicly available API rates.