live
weekly refresh
basedagi.org
▸ compare

Anthropic: Claude Sonnet 4.6 vs OpenAI: GPT-5.1

Side-by-side benchmark comparison based on independent public data. Scores, pricing, context, and task breakdown.

▸ verdict
Anthropic: Claude Sonnet 4.6
88.7
higher score
vs
OpenAI: GPT-5.1
81.1
Anthropic: Claude Sonnet 4.6 leads with a higher composite benchmark score. It's the stronger choice for general-purpose tasks based on public benchmark data.
▸ score breakdown
TaskAnthropic: Claude Sonnet 4.6OpenAI: GPT-5.1Δ
Overall88.781.1+7.6
Coding56.561.1-4.6
Reasoning68.068.4-0.4
Math84.0
Writing80.0
JSON
▸ specs & pricing
AttributeAnthropic: Claude Sonnet 4.6OpenAI: GPT-5.1
ProviderAnthropicOpenai
Context window1M400K
Input $/M tokens$3.00/M$1.25/M
Output $/M tokens$15.00/M$10.00/M
Weightsproprietaryproprietary
▸ frequently asked

Is Anthropic: Claude Sonnet 4.6 better than OpenAI: GPT-5.1?

Anthropic: Claude Sonnet 4.6 scores higher overall (88.7 vs 81.1) in the benchmark composite. The best choice depends on the specific use case and budget.

Which is cheaper: Anthropic: Claude Sonnet 4.6 or OpenAI: GPT-5.1?

OpenAI: GPT-5.1 is cheaper at $1.25/M input tokens vs $3/M for Anthropic: Claude Sonnet 4.6.

Which model is better for coding?

OpenAI: GPT-5.1 leads on coding with a score of 61.1 vs 56.5 for Anthropic: Claude Sonnet 4.6. This is based on SWE-Bench, Aider Polyglot, and LiveCodeBench data.