▸ compare
Anthropic: Claude Sonnet 4.5 vs OpenAI: GPT-5.2
Side-by-side benchmark comparison based on independent public data. Scores, pricing, context, and task breakdown.
▸ verdict
Anthropic: Claude Sonnet 4.5
85.7
higher score
vs
OpenAI: GPT-5.2
81.3
Anthropic: Claude Sonnet 4.5 leads with a higher composite benchmark score. It's the stronger choice for general-purpose tasks based on public benchmark data.
▸ score breakdown
| Task | Anthropic: Claude Sonnet 4.5 | OpenAI: GPT-5.2 | Δ |
|---|---|---|---|
| Overall | 85.7 | 81.3 | +4.4 |
| Coding | 59.8 | 69.9 | -10.1 |
| Reasoning | 45.3 | 67.8 | -22.5 |
| Math | — | — | — |
| Writing | 84.3 | 79.9 | +4.4 |
| JSON | 28.6 | 54.2 | -25.6 |
▸ specs & pricing
| Attribute | Anthropic: Claude Sonnet 4.5 | OpenAI: GPT-5.2 |
|---|---|---|
| Provider | Anthropic | Openai |
| Context window | 1M | 400K |
| Input $/M tokens | $3.00/M | $1.75/M |
| Output $/M tokens | $15.00/M | $14.00/M |
| Weights | proprietary | proprietary |
▸ frequently asked
Is Anthropic: Claude Sonnet 4.5 better than OpenAI: GPT-5.2?
Anthropic: Claude Sonnet 4.5 scores higher overall (85.7 vs 81.3) in the benchmark composite. The best choice depends on the specific use case and budget.
Which is cheaper: Anthropic: Claude Sonnet 4.5 or OpenAI: GPT-5.2?
OpenAI: GPT-5.2 is cheaper at $1.75/M input tokens vs $3/M for Anthropic: Claude Sonnet 4.5.
Which model is better for coding?
OpenAI: GPT-5.2 leads on coding with a score of 69.9 vs 59.8 for Anthropic: Claude Sonnet 4.5. This is based on SWE-Bench, Aider Polyglot, and LiveCodeBench data.