BasedAGIBasedAGI

Model Profile

microsoft/wizardlm-2-8x22b

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/microsoft/wizardlm-2-8x22b

Author: microsoft

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 27.6%

Evidence points: 87

Raw rows: 90

Weighted rows: 11

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Intelligence Profile

IQ57%EQ *72%Accuracy *59%Creativity45%Based *53%

Dimension Breakdown

IQ7 benchmarks
56.6%
EQ5 benchmarks
71.8%*
Accuracy2 benchmarks
58.6%*
Creativity7 benchmarks
45.2%
Based3 benchmarks
53.1%*

* Low confidence — limited benchmark evidence for this dimension

5/5 dimensions scored · Last updated Apr 30, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

5 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

124

Total Measurements

90

Weighted Measurements

11

Weighted Sources

8

Raw Source Coverage

ugi_main 60mmlu_pro_leaderboard 15open_llm_leaderboard_results 5openrouter_models 3aider_code_editing 2eq_bench 1

Weighted Source Coverage

ugi_main 3aider_code_editing 2eq_bench 1open_llm_leaderboard_results 1openllm_bbh_official 1openllm_gpqa_official 1

Best Use Cases for This Model

Use CaseScore
Product positioning and messaging

use_case.mkt.product_positioning

18.6%
Campaign brief

use_case.mkt.campaign_brief

18.6%
Social post generation

use_case.mkt.social_post_generation

18.6%
Ad copy variants

use_case.mkt.ad_copy_variants

17.9%
Personalized sales outreach

use_case.mkt.sales_outreach_personalized

17.9%
Screenplay scene writing

use_case.creative.screenplay_scene

15.9%
Poetry and lyrics

use_case.creative.poetry_lyrics

15.9%
Brand voice localization

use_case.mkt.brand_voice_localization

14.8%
Overrefusal (eval)

use_case.security.overrefusal_eval

13.8%
Scam and social engineering resistance (eval)

use_case.security.scam_social_engineering_resistance_eval

13.8%
Crisis escalation protocol (eval)

use_case.safety.crisis_escalation_protocol

13.8%
Jailbreak resistance (eval)

use_case.security.jailbreak_resistance_eval

13.8%