live
weekly refresh
basedagi.org
▸ compare

OpenAI: GPT-5.5 vs Anthropic: Claude Sonnet 4.5

Side-by-side benchmark comparison based on independent public data. Scores, pricing, context, and task breakdown.

▸ verdict
OpenAI: GPT-5.5
91.1
higher score
vs
Anthropic: Claude Sonnet 4.5
85.7
OpenAI: GPT-5.5 leads with a higher composite benchmark score. It's the stronger choice for general-purpose tasks based on public benchmark data.
▸ score breakdown
TaskOpenAI: GPT-5.5Anthropic: Claude Sonnet 4.5Δ
Overall91.185.7+5.4
Coding59.8
Reasoning78.445.3+33.1
Math
Writing88.784.3+4.4
JSON28.6
▸ specs & pricing
AttributeOpenAI: GPT-5.5Anthropic: Claude Sonnet 4.5
ProviderOpenaiAnthropic
Context window1M1M
Input $/M tokens$5.00/M$3.00/M
Output $/M tokens$30.00/M$15.00/M
Weightsproprietaryproprietary
▸ frequently asked

Is OpenAI: GPT-5.5 better than Anthropic: Claude Sonnet 4.5?

OpenAI: GPT-5.5 scores higher overall (91.1 vs 85.7) in the benchmark composite. The best choice depends on the specific use case and budget.

Which is cheaper: OpenAI: GPT-5.5 or Anthropic: Claude Sonnet 4.5?

Anthropic: Claude Sonnet 4.5 is cheaper at $3/M input tokens vs $5/M for OpenAI: GPT-5.5.

Which model is better for coding?

Both models perform similarly on coding benchmarks (SWE-Bench, Aider Polyglot, LiveCodeBench).