Back to KB
Difficulty
Intermediate
Read Time
54 min

LLM API Pricing Trends Q2 2026 β€” Who Got Cheaper, Who Got Expensive

By Codcompass TeamΒ·Β·54 min read

The LLM market has repriced dramatically since early 2025. Frontier intelligence that cost $10/M input tokens 18 months ago now runs $1–3/M. Budget tiers have hit $0.10/M. But not every direction is down β€” Anthropic's budget tier got more expensive when Haiku 3 retired. Here's the full picture.

If you haven't re-evaluated your model selection in the past six months, you are almost certainly overpaying. The LLM pricing landscape has moved more in Q1–Q2 2026 than in most full calendar years before it. Multiple flagship models dropped 50–80% in price. New model generations entered with competitive pricing from day one. And a few quiet deprecations pushed some teams onto more expensive tiers without noticing.

This is a full-provider pricing audit as of May 2026 β€” what changed, by how much, and what it means for production workloads. All pricing reflects published API rates. Use the Benchwright /compare tool to model your specific call volume and token mix.

The Full Pricing Table β€” Q2 2026

Every major provider, current rates, with change indicators versus late 2025 prices.

Provider

Model

Input ($/1M)

Output ($/1M)

vs Late 2025

OpenAI

GPT-4o

$2.50

$10.00

βˆ’50% input

OpenAI

GPT-4o mini

$0.15

$0.60

Stable

OpenAI

GPT-4.1

$2.00

$8.00

NEW

OpenAI

GPT-4.1 Nano

$0.10

$0.40

NEW

OpenAI

o3

$2.00

$8.00

βˆ’80%

OpenAI

o4-mini

$1.10

$4.40

NEW

OpenAI

GPT-5

$1.25

$10.00

NEW

Anthropic

Claude Haiku 3 (retired)

$0.25

$1.25

EOL Apr 19

Anthropic

Claude Haiku 3.5

$0.80

$4.00

Stable

Anthropic

Claude Haiku 4.5

$1.00

$5.00

NEW

Anthropic

Claude Sonnet 4.6

$3.00

$15.00

NEW

Anthropic

Claude Opus 4.6

$5.00

$25.00

NEW

Google

Gemini 2.0 Flash

$0.10

$0.40

EOL Jun 1

Google

Gemini 2.5 Flash-Lite

$0.10

$0.40

NEW

Google

Gemini 2.5 Flash

$0.30

$2.50

NEW

Google

Gemini 2.5 Pro (≀200K)

$1.25

$10.00

βˆ’25% vs 1.5 Pro

Mistral

Mistral Small 3.1

$0.10

$0.30

βˆ’75%

Mistral

Mistral Large 3

$2.00

$6.00

βˆ’50%

xAI

Grok 4.3

$1.25

$2.50

βˆ’83% output

DeepSeek

DeepSeek V3.2

$0.28

$0.42

Stable

DeepSeek

DeepSeek R1

$0.55

$2.19

Stable

Meta

Llama 4 Maverick (Together)

$0.15

$0.60

NEW

Cohere

Command A

$2.50

$10.00

NEW flagship

Who Got Cheaper (and by How Much)

OpenAI β€” Aggressive Repricing Across the Board

OpenAI has made the most dramatic pricing moves of any major provider in 2026. GPT-4o input dropped from $5/M to $2.50/M in a cut that happened quietly in mid-2025 and held into Q2 2026. The bigger story

πŸŽ‰ Mid-Year Sale β€” Unlock Full Article

Base plan from just $4.99/mo or $49/yr

Sign in to read the full article and unlock all 635+ tutorials.

Sign In / Register β€” Start Free Trial

7-day free trial Β· Cancel anytime Β· 30-day money-back