Model Profile
GLM-5.1
Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.
Identity
ID: zai-org/GLM-5.1
Author: zai-org
Origin: huggingface_catalog
Arch: unknown
Benchmark Coverage
Scored use cases: 7
Avg confidence: 16.4%
Evidence points: 83
Raw rows: 100
Weighted rows: 15
Catalog Metadata
Parameters: unknown
Context window: 4096
Downloads: 256,484
Intelligence Profile
Dimension Breakdown
No eq benchmarks found
* Low confidence — limited benchmark evidence for this dimension
4/5 dimensions scored · Last updated Apr 29, 2026
Benchmark Signals
Click through to the benchmark source behind this model profile.
OpenHands Issue Resolution
issue_resolution_score_pct
Normalized value 87.8% · confidence 100.0%
Strongest impact in Agentic bug fixing
openhands_issue_resolution.issue_resolution_score_pct · Apr 29, 2026
OpenHands Index
average_score_pct
Normalized value 65.0% · confidence 100.0%
Strongest impact in Autonomous Coding Agent
openhands_index.average_score_pct · Apr 29, 2026
OpenHands Index
issue_resolution_score_pct
Normalized value 87.8% · confidence 100.0%
Strongest impact in CAD scripting helper
openhands_index.issue_resolution_score_pct · Apr 29, 2026
OpenHands Index
testing_score_pct
Normalized value 69.8% · confidence 100.0%
Strongest impact in IDE code completion
openhands_index.testing_score_pct · Apr 29, 2026
OpenHands Index
greenfield_score_pct
Normalized value 50.0% · confidence 100.0%
Strongest impact in IDE code completion
openhands_index.greenfield_score_pct · Apr 29, 2026
OpenHands Index
information_gathering_score_pct
Normalized value 78.1% · confidence 100.0%
Strongest impact in Function Calling / Tool Use Agent
openhands_index.information_gathering_score_pct · Apr 29, 2026
Some fit rows have limited benchmark evidence.
7 of 7 scored use cases have low confidence or thin contributor coverage.
Coverage Diagnostics
actively scoredUse-Case Scores
7
Total Measurements
100
Weighted Measurements
15
Weighted Sources
8
Raw Source Coverage
Weighted Source Coverage
Best Use Cases for This Model
| Use Case | Score |
|---|---|
| PR review agent use_case.dev.pr_review_agent | 12.7% |
| Autonomous Coding Agent use_case.dev.autonomous_coding_agent | 12.6% |
| Agentic bug fixing use_case.dev.agentic_bug_fixing | 12.5% |
| CAD scripting helper use_case.eng.cad_scripting_helper | 12.2% |
| IDE code completion use_case.dev.ide_completion | 12.0% |
| Code generation use_case.dev.code_generation | 11.8% |
| Function Calling / Tool Use Agent use_case.dev.function_calling_agent | 11.1% |