BasedAGIBasedAGI
Menu
Rankings live

Model Profile

openai/gpt-5

External Benchmark Shadowexternal_benchmark_shadowinternal
4,096 ctx

Use this page to decide where this model is a strong fit. Rankings below are benchmark-backed by use case, with explicit confidence and contributor metrics.

Identity

ID: external/openai/gpt-5

Author: openai

Origin: external_benchmark_shadow

Arch: unknown

Benchmark Coverage

Scored use cases: 12

Avg confidence: 16.5%

Evidence points: 134

Raw rows: 113

Weighted rows: 38

Catalog Metadata

Parameters: unknown

Context window: 4096

Downloads: 0

Some fit rows have limited benchmark evidence.

12 of 12 scored use cases have low confidence or thin contributor coverage.

Coverage Diagnostics

actively scored

Use-Case Scores

52

Total Measurements

113

Weighted Measurements

38

Weighted Sources

14

Raw Source Coverage

halluhard_leaderboard 17medhelm_leaderboard 12baxbench_leaderboard 9browsergym_leaderboard 8extractbench_paper 8sciarena_leaderboard 7

Weighted Source Coverage

basedagi_doc_summarization_eval 4basedagi_kb_qna_eval 4basedagi_log_triage_eval 4basedagi_support_bot_eval 4medhelm_leaderboard 4extractbench_paper 3

Best Use Cases for This Model

Use CaseScore
Log triage

use_case.sre.log_triage

18.8%
IDE code completion

use_case.dev.ide_completion

14.9%
CAD scripting helper

use_case.eng.cad_scripting_helper

14.2%
Support bot (RAG grounded)

use_case.cx.support_rag_bot

13.5%
PR review agent

use_case.dev.pr_review_agent

13.3%
Code generation

use_case.dev.code_generation

13.0%
Knowledge base Q&A (with citations)

use_case.business.kb_qna_with_citations

12.9%
Contract redline summary

use_case.legal.contract_redline_summary

12.4%
Document summarization

use_case.business.doc_summarization

12.3%
Contract term extraction

use_case.legal.contract_term_extraction

12.0%
Clause playbook check

use_case.legal.playbook_clause_check

12.0%
Agentic bug fixing

use_case.dev.agentic_bug_fixing

10.8%

Deployment Fit Calculator

Model

openai/gpt-5

external/openai/gpt-5

2-bit8-bit

Insufficient

Unknown parameter count. Cannot estimate deployment fit.

Required VRAM

~0.0GB

Est. Throughput

0.00 tok/s

Deployment Fit Matrix

GPU4-bit6-bit8-bit
RTX 3060 12GBInsufficientInsufficientInsufficient
RTX 3090 24GBInsufficientInsufficientInsufficient
RTX 4090 24GBInsufficientInsufficientInsufficient
Mac Studio M2 Ultra 192GBInsufficientInsufficientInsufficient