Question 1

Is Anthropic: Claude Opus 4.5 better than OpenAI: GPT-5.1?

Accepted Answer

Anthropic: Claude Opus 4.5 scores higher overall (86.2 vs 81.1) in the benchmark composite. The best choice depends on the specific use case and budget.

Question 2

Which is cheaper: Anthropic: Claude Opus 4.5 or OpenAI: GPT-5.1?

Accepted Answer

OpenAI: GPT-5.1 is cheaper at $1.25/M input tokens vs $5/M for Anthropic: Claude Opus 4.5.

Question 3

Which model is better for coding?

Accepted Answer

Anthropic: Claude Opus 4.5 leads on coding with a score of 69.1 vs 61.1 for OpenAI: GPT-5.1. This is based on SWE-Bench, Aider Polyglot, and LiveCodeBench data.

Task	Anthropic: Claude Opus 4.5	OpenAI: GPT-5.1	Δ
Overall	86.2	81.1	+5.1
Coding	69.1	61.1	+8.0
Reasoning	71.7	68.4	+3.3
Math	—	84.0	—
Writing	84.7	80.0	+4.7
JSON	46.1	—	—

Attribute	Anthropic: Claude Opus 4.5	OpenAI: GPT-5.1
Provider	Anthropic	Openai
Context window	200K	400K
Input $/M tokens	$5.00/M	$1.25/M
Output $/M tokens	$25.00/M	$10.00/M
Weights	proprietary	proprietary

Anthropic: Claude Opus 4.5 vs OpenAI: GPT-5.1

Is Anthropic: Claude Opus 4.5 better than OpenAI: GPT-5.1?

Which is cheaper: Anthropic: Claude Opus 4.5 or OpenAI: GPT-5.1?

Which model is better for coding?