Model Profile
qwen-2.5-coder32b-instruct
Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.
Identity
ID: external/qwen/qwen-2-5-coder32b-instruct
Author: qwen
Origin: external_benchmark_shadow
Arch: unknown
Benchmark Coverage
Scored use cases: 12
Avg confidence: 34.4%
Evidence points: 188
Raw rows: 93
Weighted rows: 22
Catalog Metadata
Parameters: unknown
Context window: 4096
Downloads: 0
Intelligence Profile
Dimension Breakdown
* Low confidence — limited benchmark evidence for this dimension
5/5 dimensions scored · Last updated Apr 30, 2026
Benchmark Signals
Click through to the benchmark source behind this model profile.
DuckDB NSQL Leaderboard
all_execution_accuracy
Normalized value 82.7% · confidence 100.0%
Strongest impact in Metric definition workshop
duckdb_nsql_leaderboard.all_execution_accuracy · Apr 30, 2026
Open LLM Leaderboard MMLU-Pro
mmlu_pro_accuracy_pct
Normalized value 54.2% · confidence 100.0%
Strongest impact in Data quality assistant
openllm_mmlu_pro_official.mmlu_pro_accuracy_pct · Apr 30, 2026
Open LLM Leaderboard GPQA
gpqa
Normalized value 44.9% · confidence 100.0%
Strongest impact in Data quality assistant
openllm_gpqa_official.gpqa · Apr 30, 2026
BigCode Models Leaderboard
average_score
Normalized value 100.0% · confidence 100.0%
Strongest impact in IDE code completion
bigcode_models_leaderboard.average_score · Apr 29, 2026
BigCodeBench Official
bigcodebench_complete_pct
Normalized value 91.9% · confidence 100.0%
Strongest impact in IDE code completion
bigcodebench_official.bigcodebench_complete_pct · Apr 29, 2026
BigCodeBench Official
bigcodebench_instruct_pct
Normalized value 95.2% · confidence 100.0%
Strongest impact in IDE code completion
bigcodebench_official.bigcodebench_instruct_pct · Apr 29, 2026
Coverage Diagnostics
actively scoredUse-Case Scores
149
Total Measurements
93
Weighted Measurements
22
Weighted Sources
15
Raw Source Coverage
Weighted Source Coverage
Best Use Cases for This Model
| Use Case | Score |
|---|---|
| Integration test generation use_case.dev.integration_tests | 20.4% |
| Simulation setup assistant use_case.eng.simulation_setup_assistant | 19.9% |
| Verilog/VHDL generation use_case.eda.verilog_generation | 19.8% |
| IDE code completion use_case.dev.ide_completion | 17.8% |
| CAD scripting helper use_case.eng.cad_scripting_helper | 16.1% |
| Metric definition workshop use_case.data.metric_definition_workshop | 16.0% |
| Data quality assistant use_case.data.data_quality_assistant | 15.8% |
| Release notes drafting use_case.dev.release_notes | 15.7% |
| Documentation from code use_case.dev.docstrings_and_docs | 15.6% |
| Refactoring assistant use_case.dev.refactoring | 15.3% |
| Code Review Assistant use_case.dev.code_review_assistant | 15.2% |
| Unit test generation use_case.dev.test_generation | 14.7% |