BasedAGIBasedAGI

Model Profile

anthropic/claude-opus-4

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/anthropic/claude-opus-4

Author: anthropic

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 21.4%

Evidence points: 135

Raw rows: 361

Weighted rows: 26

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Price / 1M tokens: $10.00 (blended 3:1)

Intelligence Profile

IQ71%EQ *91%Accuracy *75%Creativity *71%Based *12%

Dimension Breakdown

IQ12 benchmarks
71.3%
EQ1 benchmark
91.2%*
Accuracy2 benchmarks
74.7%*
Creativity2 benchmarks
70.6%*
Based1 benchmark
12.0%*

* Low confidence — limited benchmark evidence for this dimension

5/5 dimensions scored · Last updated Apr 14, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

12 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

103

Total Measurements

361

Weighted Measurements

26

Weighted Sources

12

Raw Source Coverage

vals_mmlu_pro 60ugi_main 57vals_mgsm 48vals_medqa 28vectara_hhem_leaderboard 21vals_legal_bench 18

Weighted Source Coverage

vectara_hhem_leaderboard 12ugi_main 3aider_polyglot 2eq_bench 1hle_leaderboard 1swebench_verified_official 1

Best Use Cases for This Model

Use CaseScore
Verilog/VHDL generation

use_case.eda.verilog_generation

20.3%
Simulation setup assistant

use_case.eng.simulation_setup_assistant

18.2%
Social post generation

use_case.mkt.social_post_generation

18.0%
Product positioning and messaging

use_case.mkt.product_positioning

18.0%
Campaign brief

use_case.mkt.campaign_brief

18.0%
Integration test generation

use_case.dev.integration_tests

17.5%
Ad copy variants

use_case.mkt.ad_copy_variants

17.0%
Personalized sales outreach

use_case.mkt.sales_outreach_personalized

17.0%
Refactoring assistant

use_case.dev.refactoring

16.5%
Terraform generation

use_case.sre.iac_terraform

16.3%
Kubernetes manifest generation

use_case.sre.iac_k8s

16.3%
Config debugging

use_case.sre.config_debugging

16.3%