Model Rankings
Updated Apr 15, 2026, 06:29 PM UTCRanked by composite utility score across benchmark-backed use cases, with the public default now preferring broader, higher-confidence model profiles.
Requires at least 4 scored dimensions and 20% average confidence. Models with confidence below 25% are shown with an amber confidence indicator — treat those rankings as provisional. Use Full Profiles Only for the strictest view.
How scoring works
Utility Score = Σ(use-case score × confidence) / Σ(confidence) across all scored use cases. Public ordering prefers full profiles first, then confidence-adjusted utility, then breadth.
Trusted Profiles
3
Models with all 5 dimensions scored and eligible for the stable overall ranking.
Emerging Profiles
0
High-potential models with 4/5 dimensions. Visible separately until their profiles fill in.
Trusted Avg Confidence
28.6%
Average confidence across models currently eligible for the trusted overall table.
0 use cases changed their #1 model since last update·20,080 model-use case pairings scored
Trusted overall ranking · full 5-dimension profiles
| Rank | Model | Utility |
|---|---|---|
| 🥈 | Open zai-org/GLM-4.6 Full profile | 31.1% |
| 🥉 | external/openai/gpt-5-2025-08-07 Full profile | 25.5% |
| #8 | API external/openai/o3-20250416 Full profile | 21.2% |
Dimensions:IQ—Reasoning & logicEQ—Social & emotionalAccuracy—Factual precisionCreativity—Creative outputBased—Direct & uncensored