Question 1

Is Anthropic: Claude Sonnet 4.6 better than OpenAI: GPT-5.1?

Accepted Answer

Anthropic: Claude Sonnet 4.6 scores higher overall (88.7 vs 81.1) in the benchmark composite. The best choice depends on the specific use case and budget.

Question 2

Which is cheaper: Anthropic: Claude Sonnet 4.6 or OpenAI: GPT-5.1?

Accepted Answer

OpenAI: GPT-5.1 is cheaper at $1.25/M input tokens vs $3/M for Anthropic: Claude Sonnet 4.6.

Question 3

Which model is better for coding?

Accepted Answer

OpenAI: GPT-5.1 leads on coding with a score of 61.1 vs 56.5 for Anthropic: Claude Sonnet 4.6. This is based on SWE-Bench, Aider Polyglot, and LiveCodeBench data.

Task	Anthropic: Claude Sonnet 4.6	OpenAI: GPT-5.1	Δ
Overall	88.7	81.1	+7.6
Coding	56.5	61.1	-4.6
Reasoning	68.0	68.4	-0.4
Math	—	84.0	—
Writing	—	80.0	—
JSON	—	—	—

Attribute	Anthropic: Claude Sonnet 4.6	OpenAI: GPT-5.1
Provider	Anthropic	Openai
Context window	1M	400K
Input $/M tokens	$3.00/M	$1.25/M
Output $/M tokens	$15.00/M	$10.00/M
Weights	proprietary	proprietary

Anthropic: Claude Sonnet 4.6 vs OpenAI: GPT-5.1

Is Anthropic: Claude Sonnet 4.6 better than OpenAI: GPT-5.1?

Which is cheaper: Anthropic: Claude Sonnet 4.6 or OpenAI: GPT-5.1?

Which model is better for coding?