Model Rankings

Updated Apr 15, 2026, 06:29 PM UTC

Ranked by composite utility score across benchmark-backed use cases, with the public default now preferring broader, higher-confidence model profiles.

Requires at least 4 scored dimensions and 20% average confidence. Models with confidence below 25% are shown with an amber confidence indicator — treat those rankings as provisional. Use Full Profiles Only for the strictest view.

How scoring works

Utility Score = Σ(use-case score × confidence) / Σ(confidence) across all scored use cases. Public ordering prefers full profiles first, then confidence-adjusted utility, then breadth.

Sort byUtility Value IQ EQ Accuracy Creativity Based Full Profiles Only

Trusted Profiles

Models with all 5 dimensions scored and eligible for the stable overall ranking.

Emerging Profiles

High-potential models with 4/5 dimensions. Visible separately until their profiles fill in.

Trusted Avg Confidence

28.6%

Average confidence across models currently eligible for the trusted overall table.

0 use cases changed their #1 model since last update·20,080 model-use case pairings scored

Trusted overall ranking · full 5-dimension profiles

Rank	Model	Utility	IQ	EQ	Accuracy	Creativity	Based	Cases	Conf.	Price/1M	Profile
🥈	GLM-4.6 Open zai-org/GLM-4.6 Full profile	31.1%	61.3%	93.3%	83.8%	77.1%	53.0%	26	26.6%	—	Full profileView
🥉	gpt-5-2025-08-07 API external/openai/gpt-5-2025-08-07 Full profile	25.5%	76.7%	97.8%	87.9%	80.1%	47.0%	151	32.3%	—	Full profileView
#8	o3-20250416 API external/openai/o3-20250416 Full profile	21.2%	81.4%	56.8%	67.7%	80.8%	65.0%	151	27.0%	—	Full profileView

Dimensions:IQ—Reasoning & logicEQ—Social & emotionalAccuracy—Factual precisionCreativity—Creative outputBased—Direct & uncensored