HappyHorse 1.0 Coming SoonLearn More
Claude Pricing in 2026: API Costs for Opus, Sonnet, and Haiku
guide

Claude Pricing in 2026: API Costs for Opus, Sonnet, and Haiku

EvoLink Team
EvoLink Team
Product Team
April 9, 2026
9 min read

Last updated: April 9, 2026 · Official vendor pricing verified against Anthropic pricing documentation. EvoLink route pricing reflects current site-supported channel pricing and may differ from direct Anthropic billing.

TL;DR

Claude API pricing depends on which model you pick and which pricing scope you mean. This guide separates:

  • official Anthropic API pricing
  • long-context / caching / batch pricing rules
  • current EvoLink route pricing, where available

Here are the current official base rates per million tokens (MTok):

ModelInputOutputBest for
Claude Opus 4.6$5.00$25.00Flagship reasoning, complex coding, agents
Claude Opus 4.5$5.00$25.00Previous-gen flagship, stable versioned ID
Claude Opus 4.1$15.00$75.00Agentic coding, multi-file refactors
Claude Sonnet 4.6$3.00$15.00Best balance of speed/cost/intelligence
Claude Sonnet 4.5$3.00$15.00Recommended default, strong coding
Claude Sonnet 4.0$3.00$15.00Production workloads, 1M context beta
Claude Haiku 4.5$1.00$5.00Fastest, cheapest — high-volume workloads

If you are comparing direct vendor pricing with gateway pricing, Anthropic's official rates are the baseline. EvoLink can expose lower route-specific pricing for the same Claude family, but that is a separate pricing scope from direct Anthropic billing.


Claude Opus 4.6 Pricing

Claude Opus 4.6 is Anthropic's latest flagship — the most intelligent Claude model with hybrid reasoning, extended thinking, and the effort parameter for cost/quality control.
ComponentRate (per MTok)
Input tokens$5.00
Output tokens$25.00
Cache write (5 min)$6.25
Cache write (1 hour)$10.00
Cache read$0.50 (0.1x input)
Batch input$2.50 (50% off)
Batch output$12.50 (50% off)
Context window: 1M tokens at standard Anthropic pricing Max output: 64K tokens Model ID: claude-opus-4-6

When Opus 4.6 is worth the premium

  • Complex multi-step coding (80.8% SWE-bench Verified)
  • Agent orchestration with tool use
  • Research synthesis requiring extended thinking
  • Tasks where quality matters more than speed

Claude Opus 4.5 Pricing

Claude Opus 4.5 shares the same pricing as Opus 4.6 and is a stable previous-generation option with a pinned versioned ID.
ComponentRate (per MTok)
Input tokens$5.00
Output tokens$25.00
Cache write (5 min)$6.25
Cache read$0.50
Model ID: claude-opus-4-5-20250514

Claude Opus 4.1 Pricing

Claude Opus 4.1 is priced higher than Opus 4.5/4.6 but delivers 74.5% SWE-bench with specialized multi-file refactoring strength.
ComponentRate (per MTok)
Input tokens$15.00
Output tokens$75.00
Cache write (5 min)$18.75
Cache read$1.50
Model ID: claude-opus-4-1-20250805

Is Opus 4.1 still worth using?

Opus 4.1 costs 3x more than Opus 4.6 per token. Use it only if your workflow specifically benefits from its older route characteristics or you need to keep a pinned version for compatibility. For most new workloads, Opus 4.6 at $5/$25 is the stronger value route.


Claude Sonnet 4.6 Pricing

Claude Sonnet 4.6 is the best balance of speed, intelligence, and cost — with 128K max output (double Opus) and extended thinking support.
ComponentRate (per MTok)
Input tokens$3.00
Output tokens$15.00
Cache write (5 min)$3.75
Cache read$0.30
Batch input$1.50 (50% off)
Batch output$7.50 (50% off)
Context window: 1M tokens at standard Anthropic pricing Max output: 128K tokens Model ID: claude-sonnet-4-6

Claude Sonnet 4.5 Pricing

Claude Sonnet 4.5 is Anthropic's recommended default model — best coding model to date with strong agent capabilities.
ComponentRate (per MTok)
Input tokens$3.00
Output tokens$15.00
Cache write (5 min)$3.75
Cache read$0.30
Context window: 200K tokens, with 1M long-context pricing still treated separately Model ID: claude-sonnet-4-5-20250929

Claude Sonnet 4.0 Pricing

Claude Sonnet 4.0 shares Sonnet-tier pricing and still uses separate long-context pricing above 200K tokens.
ComponentRate (per MTok)
Input tokens$3.00
Output tokens$15.00
Input (>200K, 1M beta)$6.00
Output (>200K, 1M beta)$22.50
Cache write (5 min)$3.75
Cache read$0.30
Model ID: claude-sonnet-4-0-20250514

Claude Haiku 4.5 Pricing

Claude Haiku 4.5 is the fastest and cheapest Claude 4 model — matches Sonnet 4 on coding at a fraction of the cost.
ComponentRate (per MTok)
Input tokens$1.00
Output tokens$5.00
Cache write (5 min)$1.25
Cache read$0.10
Batch input$0.50 (50% off)
Batch output$2.50 (50% off)
Context window: 200K tokens Model ID: claude-haiku-4-5-20251001

When to choose Haiku over Sonnet

  • Free-tier products where cost per request matters
  • Parallelized sub-agents and high-volume pipelines
  • Latency-sensitive chat and support agents
  • Tasks where Sonnet 4 quality is sufficient (73.3% SWE-bench)

Cost Comparison: Claude vs GPT vs Gemini

How does Claude pricing compare to other frontier models? Here's a side-by-side:

ModelInput/MTokOutput/MTokContextMax Output
Claude Opus 4.6$5.00$25.001M64K
Claude Sonnet 4.6$3.00$15.001M128K
Claude Haiku 4.5$1.00$5.00200K
GPT-5.4$2.50$20.001M128K
GPT-5.2$1.75$14.00400K128K
Gemini 3.1 Pro$2.00$12.001M64K
For a detailed model-by-model comparison, see GPT-5.4 vs Claude Opus 4.6 vs Gemini 3.1 Pro.

How to Reduce Claude API Costs

1. Use prompt caching

If you reuse system prompts or long context across requests, caching can cut input costs by up to 90%. Cache reads are billed at 0.1x the base input rate.

2. Use batch processing

Batch API provides a 50% discount on both input and output tokens for asynchronous jobs. Ideal for nightly automation, data processing, and non-real-time workloads.

3. Pick the right model tier

Don't use Opus for tasks Sonnet can handle. Don't use Sonnet for tasks Haiku can handle. Match model capability to task complexity.

EvoLink currently lists Claude route pricing below Anthropic's direct rates for some tiers. That is channel-specific pricing, not a replacement for Anthropic's official pricing page:
  • Claude Opus 4.6: from $4.50/MTok input (−10%)
  • Claude Sonnet 4.6: from $2.70/MTok input (−10%)
  • Claude Haiku 4.5: from $0.90/MTok input (−10%)

Use direct Anthropic pricing when budgeting for vendor-native usage. Use EvoLink route pricing when you specifically plan to buy Claude through EvoLink's OpenAI-compatible endpoint.

5. Monitor and set budgets

Use Anthropic's usage dashboard or EvoLink's built-in monitoring to track spend by model, set alerts, and avoid surprise bills.


FAQ

How much does Claude API cost?

Claude API pricing varies by model. On Anthropic's official API pricing, Haiku 4.5 starts at $1/$5 per million tokens, Sonnet 4.6 is $3/$15, and Opus 4.6 is $5/$25. Prompt caching and batch processing can reduce effective costs further. All figures here are per million tokens as of April 9, 2026.

What is the cheapest Claude API model?

Claude Haiku 4.5 at $1 input / $5 output per million tokens. It is the lowest-cost current Claude model for high-volume production workloads.

Is Claude API free?

There is no permanent free Claude API tier for production usage. Anthropic's API pricing is usage-based, and any trial or promotional credit availability should be checked against Anthropic's current billing pages. EvoLink also advertises free trial credits for new accounts on current site messaging.

How does Claude pricing compare to GPT?

Claude Sonnet 4.6 ($3/$15) sits above GPT-5.2 ($1.75/$14) and Gemini 3.1 Pro ($2/$12) on direct vendor pricing, while Claude Opus 4.6 ($5/$25) is the premium Claude tier. Through EvoLink, route pricing can narrow that gap, but those prices should be treated separately from official Anthropic billing.

What is prompt caching and how much does it save?

Prompt caching lets you reuse frequently sent context (system prompts, documents) across requests. Cache writes cost 1.25x the base input rate, but cache reads cost only 0.1x — saving up to 90% on repeated input tokens. This is especially valuable for agents and chatbots with long system prompts.

Does Claude have batch processing discounts?

Yes. Anthropic's Batch API provides a 50% discount on both input and output tokens. Jobs are processed asynchronously within 24 hours. This is ideal for data pipelines, content generation, and any workload that doesn't need real-time responses.

Where can I access Claude API at the best price?

EvoLink currently advertises Claude route pricing below official Anthropic rates through its OpenAI-compatible endpoint. If you plan to buy Claude through EvoLink rather than directly from Anthropic, that is the route-specific pricing page to compare.

Ready to integrate Claude into your product? EvoLink gives you access to every Claude model through a single OpenAI-compatible endpoint — with discounted route pricing, built-in rate limit handling, and usage monitoring.


Prices sourced from anthropic.com/pricing as of April 2026. Rates may change — always verify current pricing before final budgeting.

Ready to Reduce Your AI Costs by 89%?

Start using EvoLink today and experience the power of intelligent API routing.