BasedAGIBasedAGI

Model Profile

Claude-3.5-Sonnet

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/anthropic/claude-3-5-sonnet

Author: anthropic

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 29.4%

Evidence points: 175

Raw rows: 140

Weighted rows: 28

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Price / 1M tokens: $6.00 (blended 3:1)

Intelligence Profile

IQ *49%EQ *48%Accuracy *66%CreativityBased

Dimension Breakdown

IQ5 benchmarks
49.5%*
EQ3 benchmarks
48.4%*
Accuracy3 benchmarks
66.5%*
Creativity0 benchmarks

No creativity benchmarks found

Insufficient data
Based0 benchmarks

No based benchmarks found

Insufficient data

* Low confidence — limited benchmark evidence for this dimension

3/5 dimensions scored · Last updated Apr 14, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

1 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

100

Total Measurements

140

Weighted Measurements

28

Weighted Sources

15

Raw Source Coverage

multilingual_mmlu_leaderboard 17mmlu_pro_leaderboard 15duckdb_nsql_leaderboard 12llm_aggrefact_leaderboard 12medhelm_leaderboard 12browsergym_leaderboard 10

Weighted Source Coverage

crmarena_leaderboard 4medhelm_leaderboard 4languagebench 3languagebench_translation_official 3duckdb_nsql_leaderboard 2llm_aggrefact_leaderboard 2

Best Use Cases for This Model

Use CaseScore
Archaic and historical translation

use_case.history.archaic_translation

27.2%
Brand voice localization

use_case.mkt.brand_voice_localization

25.1%
Legal translation

use_case.legal.legal_translation

21.8%
Patient-friendly explanations

use_case.health.patient_friendly_summaries

21.0%
Grammar and writing coach

use_case.lang.grammar_coach

20.6%
Translation and localization

use_case.business.translation_localization

20.1%
Multilingual Customer Support

use_case.cx.multilingual_support

19.8%
Historical document summarization

use_case.history.historical_doc_summarization

19.4%
Language conversation partner

use_case.lang.conversation_partner

18.9%
Executive brief from metrics

use_case.data.exec_brief_from_metrics

18.6%
Patient education bot (RAG grounded)

use_case.health.patient_education_bot

17.6%
Tail spend categorization

use_case.proc.tail_spend_categorization

17.3%