BasedAGIBasedAGI

Model Profile

mistralai/Mistral-Large-Instruct-2411

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/mistralai/mistral-large-instruct-2411

Author: mistralai

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 27.5%

Evidence points: 88

Raw rows: 94

Weighted rows: 11

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Intelligence Profile

IQ75%EQ *79%Accuracy *93%Creativity60%Based *78%

Dimension Breakdown

IQ7 benchmarks
74.9%
EQ5 benchmarks
78.5%*
Accuracy2 benchmarks
93.4%*
Creativity7 benchmarks
60.3%
Based3 benchmarks
77.6%*

* Low confidence — limited benchmark evidence for this dimension

5/5 dimensions scored · Last updated Apr 30, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

4 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

118

Total Measurements

94

Weighted Measurements

11

Weighted Sources

8

Raw Source Coverage

ugi_main 60mmlu_pro_leaderboard 15bridge_medical_leaderboard 9open_llm_leaderboard_results 5eq_bench 1openllm_bbh_official 1

Weighted Source Coverage

ugi_main 3bridge_medical_leaderboard 2eq_bench 1open_llm_leaderboard_results 1openllm_bbh_official 1openllm_gpqa_official 1

Best Use Cases for This Model

Use CaseScore
Product positioning and messaging

use_case.mkt.product_positioning

22.9%
Campaign brief

use_case.mkt.campaign_brief

22.9%
Social post generation

use_case.mkt.social_post_generation

22.9%
Ad copy variants

use_case.mkt.ad_copy_variants

21.9%
Personalized sales outreach

use_case.mkt.sales_outreach_personalized

21.9%
Screenplay scene writing

use_case.creative.screenplay_scene

21.2%
Poetry and lyrics

use_case.creative.poetry_lyrics

21.2%
Brand voice localization

use_case.mkt.brand_voice_localization

18.6%
Long-form story co-author

use_case.creative.longform_story

18.4%
Crisis escalation protocol (eval)

use_case.safety.crisis_escalation_protocol

18.4%
Jailbreak resistance (eval)

use_case.security.jailbreak_resistance_eval

18.4%
Scam and social engineering resistance (eval)

use_case.security.scam_social_engineering_resistance_eval

18.4%