Model Profile

openai/gpt-5

Name: openai/gpt-5
Author: openai

External Benchmark Shadowexternal_benchmark_shadowinternal

4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/openai/gpt-5

Author: openai

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 16.5%

Evidence points: 134

Raw rows: 113

Weighted rows: 38

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Some fit rows have limited benchmark evidence.

12 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

Total Measurements

113

Weighted Measurements

Weighted Sources

Raw Source Coverage

halluhard_leaderboard 17medhelm_leaderboard 12baxbench_leaderboard 9browsergym_leaderboard 8extractbench_paper 8sciarena_leaderboard 7

Weighted Source Coverage

basedagi_doc_summarization_eval 4basedagi_kb_qna_eval 4basedagi_log_triage_eval 4basedagi_support_bot_eval 4medhelm_leaderboard 4extractbench_paper 3

Best Use Cases for This Model

Use Case	Vertical	Score	Confidence	Evidence	Top Contributor
Log triage use_case.sre.log_triage	devops_sre	18.8%	22.2%	11	BasedAGI Log Triage Eval: overall_score_pct
IDE code completion use_case.dev.ide_completion	developer_tools	14.9%	17.1%	11	Aider Polyglot Leaderboard: percent_correct_pct
CAD scripting helper use_case.eng.cad_scripting_helper	engineering	14.2%	16.4%	12	Aider Polyglot Leaderboard: percent_correct_pct
Support bot (RAG grounded) use_case.cx.support_rag_bot	customer_experience	13.5%	17.7%	11	BasedAGI Support Bot Eval: overall_score_pct
PR review agent use_case.dev.pr_review_agent	developer_tools	13.3%	15.3%	13	GAIA Results Public: score
Code generation use_case.dev.code_generation	developer_tools	13.0%	15.1%	12	Aider Polyglot Leaderboard: percent_correct_pct
Knowledge base Q&A (with citations) use_case.business.kb_qna_with_citations	business_productivity	12.9%	18.3%	11	BasedAGI KB Q&A Eval: overall_score_pct
Contract redline summary use_case.legal.contract_redline_summary	legal	12.4%	15.7%	10	LEXam Leaderboard: average_score_pct
Document summarization use_case.business.doc_summarization	business_productivity	12.3%	16.8%	11	BasedAGI Document Summarization Eval: overall_score_pct
Contract term extraction use_case.legal.contract_term_extraction	legal	12.0%	15.2%	10	LEXam Leaderboard: average_score_pct
Clause playbook check use_case.legal.playbook_clause_check	legal	12.0%	15.2%	10	LEXam Leaderboard: average_score_pct
Agentic bug fixing use_case.dev.agentic_bug_fixing	developer_tools	10.8%	12.8%	12	GAIA Results Public: score

Deployment Fit Calculator

Model

openai/gpt-5

external/openai/gpt-5

Hardware

Quantization: 4-bit

2-bit8-bit

Insufficient

Unknown parameter count. Cannot estimate deployment fit.

Required VRAM

~0.0GB

Est. Throughput

0.00 tok/s

Deployment Fit Matrix

GPU	4-bit	6-bit	8-bit
RTX 3060 12GB	Insufficient	Insufficient	Insufficient
RTX 3090 24GB	Insufficient	Insufficient	Insufficient
RTX 4090 24GB	Insufficient	Insufficient	Insufficient
Mac Studio M2 Ultra 192GB	Insufficient	Insufficient	Insufficient