BasedAGIBasedAGI

Model Profile

Steelskull/L3.3-MS-Nevoria-70b

External Benchmark Shadowexternal_benchmark_shadowpublic
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/steelskull/l3-3-ms-nevoria-70b

Author: steelskull

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 24.5%

Evidence points: 63

Raw rows: 69

Weighted rows: 8

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Intelligence Profile

IQ *82%EQ *76%Accuracy *77%Creativity59%Based *72%

Dimension Breakdown

IQ6 benchmarks
82.4%*
EQ4 benchmarks
75.9%*
Accuracy2 benchmarks
77.4%*
Creativity6 benchmarks
59.4%
Based3 benchmarks
72.4%*

* Low confidence — limited benchmark evidence for this dimension

5/5 dimensions scored · Last updated Apr 30, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

Some fit rows have limited benchmark evidence.

7 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

111

Total Measurements

69

Weighted Measurements

8

Weighted Sources

6

Raw Source Coverage

ugi_main 60open_llm_leaderboard_results 5openllm_bbh_official 1openllm_gpqa_official 1openllm_ifeval_official 1openllm_mmlu_pro_official 1

Weighted Source Coverage

ugi_main 3open_llm_leaderboard_results 1openllm_bbh_official 1openllm_gpqa_official 1openllm_ifeval_official 1openllm_mmlu_pro_official 1

Best Use Cases for This Model

Use CaseScore
Poetry and lyrics

use_case.creative.poetry_lyrics

21.4%
Screenplay scene writing

use_case.creative.screenplay_scene

21.4%
Social post generation

use_case.mkt.social_post_generation

20.6%
Campaign brief

use_case.mkt.campaign_brief

20.6%
Product positioning and messaging

use_case.mkt.product_positioning

20.6%
Ad copy variants

use_case.mkt.ad_copy_variants

19.8%
Personalized sales outreach

use_case.mkt.sales_outreach_personalized

19.8%
Long-form story co-author

use_case.creative.longform_story

18.5%
Crisis escalation protocol (eval)

use_case.safety.crisis_escalation_protocol

18.0%
Refusal profile (eval)

use_case.security.refusal_profile_eval

18.0%
Jailbreak resistance (eval)

use_case.security.jailbreak_resistance_eval

18.0%
Overrefusal (eval)

use_case.security.overrefusal_eval

18.0%