The Claude 4.x family

Three model sizes, one philosophy: same family, different speed/cost/capability sliders. Pick by task. Mix in production.

On this page

1. TL;DR

ModelIDBest atCostContext
Opus 4.7claude-opus-4-7Hard reasoning, multi-step plans, ambiguous specs.$$$200k (1M variant available)
Sonnet 4.6claude-sonnet-4-6The daily driver. 90% of pro work.$$200k
Haiku 4.5claude-haiku-4-5-20251001High-volume, latency-critical, bounded tasks.$200k
If you only remember one thing: default to Sonnet. Reach for Opus when the task is open-ended or ambiguous, and Haiku when you'll make a million calls and each one is small.

2. Claude Opus 4.7

The reasoning flagship. Best at multi-step plans, large refactors, design decisions, and anything where the model needs to think before answering. Extended thinking is supported and recommended for hard problems.

Use when

Avoid when

1M context variant

For huge codebases or massive document workloads, claude-opus-4-7[1m] exposes a 1M-token context window. Use sparingly — token cost is the same per-token but you'll easily ship 500k tokens per request. Pairs perfectly with prompt caching.

3. Claude Sonnet 4.6

The default model for almost everything. Strong reasoning, fast enough for interactive UX, ~5× cheaper than Opus. Most production apps should start here.

Use when

Promote to Opus when

4. Claude Haiku 4.5

The cheap, fast model for high-volume tasks. Strong enough for classification, routing, intent detection, autocomplete, simple Q&A. Not the model for complex reasoning.

Use when

Common patterns

No extended thinking on Haiku. It's a speed-first model. If you find yourself needing thinking, upgrade to Sonnet.

5. How to pick (decision flow)

Is the task open-ended / multi-step / ambiguous?
  └─ Yes → Opus 4.7
  └─ No  → Is latency critical OR are you making >100k calls/day?
            └─ Yes → Haiku 4.5
            └─ No  → Sonnet 4.6

Validate the choice with an eval set of ~50 representative inputs. If Sonnet hits your bar, stay there. If it doesn't, escalate.

6. Mixing models in one product

Most pro deployments use multiple models. Common topology:

StageModelWhy
Route the user's intentHaiku 4.5Fast, cheap, accurate enough for "is this support / sales / engineering?"
Filter / guardrailHaiku 4.5Same — cheap precheck before paying for Sonnet.
Execute the user's requestSonnet 4.6Quality + cost sweet spot.
Handle the gnarly edge caseOpus 4.7Escalation path when Sonnet's confidence is low.
Generate the final polished outputSonnet 4.6 or Opus 4.7Quality matters at the end.

Cost-wise, this beats "always Opus" by 5–20×, often with no quality drop.

7. Migration notes (4.5 / 4.6 → 4.7)

Sonnet 4.6 → Opus 4.7 / Sonnet 4.7 is largely a model-string change. The API surface is stable. Things to know:

8. Deprecated / retired models

Anthropic retires older models on a published schedule. If your code still references one of these, plan a migration:

OldReplace with
claude-3-opus-*claude-opus-4-7
claude-3-5-sonnet-*claude-sonnet-4-6
claude-3-haiku-*claude-haiku-4-5-20251001
claude-3-5-haiku-*claude-haiku-4-5-20251001

Check docs.claude.com/about-claude/models for the live retirement schedule.