BasedAGI | What Model Should You Use?

What model should
you use?

Benchmark-backed rankings for 143 use cases. No opinions. No vibes. Just evidence.

Code Generation RAG Q&A Text-to-SQL Support Bot Doc Summary Log Triage Debugging Clinical Notes

167/168

Benchmark sources

143

Use cases scored

Daily

Updates

Browse Use Cases

Find ranked models for any workflow

Explore →

Model Rankings

Cross-task leaderboard by utility score

View Rankings →

Top Models

Full Rankings →

#	Model	Score	Use Cases	Confidence
1	gemini-3-pro-preview	25.8%	143	28.3%
2	gemini-2.5-pro	24.7%	143	35.0%
3	gpt-4.1-20250414	22.5%	143	30.8%
4	anthropic/claude-sonnet-4.6	21.1%	129	23.5%
5	Grok-4-0709	21.1%	143	27.3%

Popular Use Cases

All Use Cases →

finance

Earnings call synthesis

Summarize earnings calls into key points, tone, and risks.

Top: gemini-3-pro-preview

42.4%

devops_sre

Log triage

Interpret logs and propose safe diagnostic steps.

Top: gemini-3-pro-preview

42.5%

business_productivity

Knowledge base Q&A (with citations)

Answer questions grounded in an internal KB, with evidence.

Top: gemini-3-pro-preview

41.9%

business_productivity

Document summarization

Summarize long business documents into scannable outputs.

Top: gemini-3-pro-preview

39.1%

legal

Contract term extraction

Extract key terms into structured fields with clause references.

Top: gemini-2.5-pro

33.4%

customer_experience

Support bot (RAG grounded)

Support chatbot grounded in docs with optional citations and escalation.

Top: gemini-3-pro-preview

36.3%

Quick Lookups

50 indexed

Best LLM for Code Generation

Benchmark-backed ranking of models for generating correct, secure code from requirements.

Best LLM for Debugging

Find the top-ranked models for localizing bugs and proposing fixes with explanations.

Best LLM for Unit Test Generation

Ranked models for generating meaningful unit tests and edge cases from code.

Best LLM for Code Review

Compare models for automated PR review covering correctness, security, and maintainability.

Best LLM for Refactoring

Ranked models for safely refactoring code while preserving behavior and improving clarity.

Best LLM for IDE Code Completion

Compare models for fast, accurate local-context code completion and snippet generation.