Model Profile

GLM-5.1

Name: GLM-5.1
Rating: 1.3 (83 reviews)
Author: zai-org

4,096 ctxOpen weights

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: zai-org/GLM-5.1

Author: zai-org

Origin: huggingface_catalog

Arch: unknown

Benchmark Coverage

Scored use cases: 7

Avg confidence: 16.4%

Evidence points: 83

Raw rows: 100

Weighted rows: 15

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 256,484

Intelligence Profile

Dimension Breakdown

IQ3 benchmarks

68.2%*

EQ0 benchmarks

No eq benchmarks found

Insufficient data

Accuracy1 benchmark

76.2%*

Creativity2 benchmarks

89.0%*

Based1 benchmark

82.0%*

* Low confidence — limited benchmark evidence for this dimension

4/5 dimensions scored · Last updated Apr 29, 2026

Benchmark Signals

Click through to the benchmark source behind this model profile.

OpenHands Issue Resolution

issue_resolution_score_pct

3.4%

Normalized value 87.8% · confidence 100.0%

Strongest impact in Agentic bug fixing

openhands_issue_resolution.issue_resolution_score_pct · Apr 29, 2026

OpenHands Index

average_score_pct

2.4%

Normalized value 65.0% · confidence 100.0%

Strongest impact in Autonomous Coding Agent

openhands_index.average_score_pct · Apr 29, 2026

OpenHands Index

issue_resolution_score_pct

2.4%

Normalized value 87.8% · confidence 100.0%

Strongest impact in CAD scripting helper

openhands_index.issue_resolution_score_pct · Apr 29, 2026

OpenHands Index

testing_score_pct

1.0%

Normalized value 69.8% · confidence 100.0%

Strongest impact in IDE code completion

openhands_index.testing_score_pct · Apr 29, 2026

OpenHands Index

greenfield_score_pct

1.0%

Normalized value 50.0% · confidence 100.0%

Strongest impact in IDE code completion

openhands_index.greenfield_score_pct · Apr 29, 2026

OpenHands Index

information_gathering_score_pct

1.0%

Normalized value 78.1% · confidence 100.0%

Strongest impact in Function Calling / Tool Use Agent

openhands_index.information_gathering_score_pct · Apr 29, 2026

Some fit rows have limited benchmark evidence.

7 of 7 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

Total Measurements

100

Weighted Measurements

Weighted Sources

Raw Source Coverage

ugi_main 60openhands_index 13swe_rebench_leaderboard 7livebench_official 6openhands_issue_resolution 4matharena_models 2

Weighted Source Coverage

openhands_index 5ugi_main 3openhands_issue_resolution 2matharena_models 1openhands_frontend 1openhands_greenfield 1

Best Use Cases for This Model

Use Case	Vertical	Score	Confidence	Evidence	Top Contributor
PR review agent use_case.dev.pr_review_agent	developer_tools	12.7%	16.6%	12	OpenHands Issue Resolution: issue_resolution_score_pct
Autonomous Coding Agent use_case.dev.autonomous_coding_agent	developer_tools	12.6%	17.2%	12	OpenHands Issue Resolution: issue_resolution_score_pct
Agentic bug fixing use_case.dev.agentic_bug_fixing	developer_tools	12.5%	16.4%	12	OpenHands Issue Resolution: issue_resolution_score_pct
CAD scripting helper use_case.eng.cad_scripting_helper	engineering	12.2%	16.4%	11	OpenHands Issue Resolution: issue_resolution_score_pct
IDE code completion use_case.dev.ide_completion	developer_tools	12.0%	17.0%	12	OpenHands Issue Resolution: issue_resolution_score_pct
Code generation use_case.dev.code_generation	developer_tools	11.8%	16.0%	12	OpenHands Issue Resolution: issue_resolution_score_pct
Function Calling / Tool Use Agent use_case.dev.function_calling_agent	developer_tools	11.1%	15.0%	12	OpenHands Issue Resolution: issue_resolution_score_pct