live
weekly refresh
basedagi.org
▸ compare

Anthropic: Claude Opus 4.6 vs OpenAI: GPT-5.4 Mini

Side-by-side benchmark comparison based on independent public data. Scores, pricing, context, and task breakdown.

▸ verdict
Anthropic: Claude Opus 4.6
96.6
higher score
vs
OpenAI: GPT-5.4 Mini
84.9
Anthropic: Claude Opus 4.6 leads with a higher composite benchmark score. It's the stronger choice for general-purpose tasks based on public benchmark data.
▸ score breakdown
TaskAnthropic: Claude Opus 4.6OpenAI: GPT-5.4 MiniΔ
Overall96.684.9+11.7
Coding71.2
Reasoning
Math
Writing
JSON
▸ specs & pricing
AttributeAnthropic: Claude Opus 4.6OpenAI: GPT-5.4 Mini
ProviderAnthropicOpenai
Context window1M400K
Input $/M tokens$5.00/M$0.75/M
Output $/M tokens$25.00/M$4.50/M
Weightsproprietaryproprietary
▸ frequently asked

Is Anthropic: Claude Opus 4.6 better than OpenAI: GPT-5.4 Mini?

Anthropic: Claude Opus 4.6 scores higher overall (96.6 vs 84.9) in the benchmark composite. The best choice depends on the specific use case and budget.

Which is cheaper: Anthropic: Claude Opus 4.6 or OpenAI: GPT-5.4 Mini?

OpenAI: GPT-5.4 Mini is cheaper at $0.75/M input tokens vs $5/M for Anthropic: Claude Opus 4.6.

Which model is better for coding?

Both models perform similarly on coding benchmarks (SWE-Bench, Aider Polyglot, LiveCodeBench).