BasedAGIBasedAGI

Model Profile

anthropic/claude-sonnet-4

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/anthropic/claude-sonnet-4

Author: anthropic

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 44.9%

Evidence points: 294

Raw rows: 526

Weighted rows: 63

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Price / 1M tokens: $6.00 (blended 3:1)

Intelligence Profile

IQ58%EQ *90%Accuracy *68%Creativity *73%Based *6%

Dimension Breakdown

IQ18 benchmarks
57.6%
EQ1 benchmark
90.4%*
Accuracy3 benchmarks
68.2%*
Creativity2 benchmarks
72.8%*
Based1 benchmark
6.0%*

* Low confidence — limited benchmark evidence for this dimension

5/5 dimensions scored · Last updated Apr 14, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Coverage Diagnostics

actively scored

Use-Case Scores

151

Total Measurements

526

Weighted Measurements

63

Weighted Sources

28

Raw Source Coverage

vals_mmlu_pro 60ugi_main 57vals_mgsm 48galileo_agent_v2 34corpfin_taxeval_public 28vals_medqa 28

Weighted Source Coverage

vectara_hhem_leaderboard 12galileo_agent_v2 10sonar_java_quality 4facts_benchmark_suite 3languagebench 3languagebench_translation_official 3

Best Use Cases for This Model

Use CaseScore
Terraform generation

use_case.sre.iac_terraform

36.7%
Kubernetes manifest generation

use_case.sre.iac_k8s

36.7%
Config debugging

use_case.sre.config_debugging

36.7%
Campaign brief

use_case.mkt.campaign_brief

34.3%
Social post generation

use_case.mkt.social_post_generation

34.3%
Product positioning and messaging

use_case.mkt.product_positioning

34.3%
Archaic and historical translation

use_case.history.archaic_translation

34.1%
Legal translation

use_case.legal.legal_translation

33.3%
Verilog/VHDL generation

use_case.eda.verilog_generation

32.7%
Personalized sales outreach

use_case.mkt.sales_outreach_personalized

32.4%
Ad copy variants

use_case.mkt.ad_copy_variants

32.4%
Brand voice localization

use_case.mkt.brand_voice_localization

32.1%