▸ compare
OpenAI: GPT-5.4 vs Anthropic: Claude Sonnet 4.6
Side-by-side benchmark comparison based on independent public data. Scores, pricing, context, and task breakdown.
▸ verdict
OpenAI: GPT-5.4
90.2
vs
Anthropic: Claude Sonnet 4.6
88.7
These two models are effectively tied on overall score within the two-point margin. Choose based on pricing, context window, or task-specific performance below.
▸ score breakdown
| Task | OpenAI: GPT-5.4 | Anthropic: Claude Sonnet 4.6 | Δ |
|---|---|---|---|
| Overall | 90.2 | 88.7 | +1.5 |
| Coding | — | 56.5 | — |
| Reasoning | 74.4 | 68.1 | +6.3 |
| Math | — | — | — |
| Writing | 88.9 | — | — |
| JSON | — | — | — |
▸ specs & pricing
| Attribute | OpenAI: GPT-5.4 | Anthropic: Claude Sonnet 4.6 |
|---|---|---|
| Provider | Openai | Anthropic |
| Context window | 1M | 1M |
| Input $/M tokens | $2.50/M | $3.00/M |
| Output $/M tokens | $15.00/M | $15.00/M |
| Weights | proprietary | proprietary |
▸ frequently asked
Is OpenAI: GPT-5.4 better than Anthropic: Claude Sonnet 4.6?
OpenAI: GPT-5.4 and Anthropic: Claude Sonnet 4.6 are closely matched in overall score. Choose based on pricing, context window, or the task-specific scores in the table above.
Which is cheaper: OpenAI: GPT-5.4 or Anthropic: Claude Sonnet 4.6?
OpenAI: GPT-5.4 is cheaper at $2.5/M input tokens vs $3/M for Anthropic: Claude Sonnet 4.6.
Which model is better for coding?
Both models perform similarly on coding benchmarks (SWE-Bench, Aider Polyglot, LiveCodeBench).