live
weekly refresh
basedagi.org
▸ compare

Anthropic: Claude Opus 4.6 vs Anthropic: Claude Opus 4.5

Side-by-side benchmark comparison based on independent public data. Scores, pricing, context, and task breakdown.

▸ verdict
Anthropic: Claude Opus 4.6
96.6
higher score
vs
Anthropic: Claude Opus 4.5
86.3
Anthropic: Claude Opus 4.6 leads with a higher composite benchmark score. It's the stronger choice for general-purpose tasks based on public benchmark data.
▸ score breakdown
TaskAnthropic: Claude Opus 4.6Anthropic: Claude Opus 4.5Δ
Overall96.686.3+10.3
Coding71.269.1+2.1
Reasoning71.8
Math
Writing84.8
JSON46.1
▸ specs & pricing
AttributeAnthropic: Claude Opus 4.6Anthropic: Claude Opus 4.5
ProviderAnthropicAnthropic
Context window1M200K
Input $/M tokens$5.00/M$5.00/M
Output $/M tokens$25.00/M$25.00/M
Weightsproprietaryproprietary
▸ frequently asked

Is Anthropic: Claude Opus 4.6 better than Anthropic: Claude Opus 4.5?

Anthropic: Claude Opus 4.6 scores higher overall (96.6 vs 86.3) in the benchmark composite. The best choice depends on the specific use case and budget.

Which is cheaper: Anthropic: Claude Opus 4.6 or Anthropic: Claude Opus 4.5?

Both models are priced equally at $5/M input tokens.

Which model is better for coding?

Anthropic: Claude Opus 4.6 leads on coding with a score of 71.2 vs 69.1 for Anthropic: Claude Opus 4.5. This is based on SWE-Bench, Aider Polyglot, and LiveCodeBench data.