BasedAGIBasedAGI
Menu
Rankings live

data_analytics

Best LLM for SQL Debugging

Compare models for diagnosing and fixing SQL queries for correctness and performance.

#1 Recommendation

gpt-4o-20241120

Strong on DuckDB NSQL Leaderboard all_execution_accuracy (96%) and DuckDB NSQL Leaderboard hard_execution_accuracy (75%)

external/openai/gpt-4o-20241120

24.5%

Score

44.7%

Confidence

15

Evidence

Ranked Models

30

Evidence Quality

82%

Scoring

Benchmark-backed

Top Signal

DuckDB NSQL Leaderboard: all_execution_accuracy

All Ranked Models

Max params:
Min confidence:
30 of 30
RankModelScore
#1gpt-4o-20241120

Strong on DuckDB NSQL Leaderboard all_execution_accuracy (96%) and DuckDB NSQL Leaderboard hard_execution_accuracy (75%)

24.5%
#3gpt-4o

Strong on DuckDB NSQL Leaderboard all_execution_accuracy (77%) and JSONSchemaBench Leaderboard medium_schema_compliance_pct (100%)

20.3%
#4deepseek/deepseek-r1
19.6%
#5qwen-2.5-72b-instruct
18.7%
#11openai/gpt-4o-mini-2024-07-18
14.8%
#15gpt-4o-2024-08-06
13.2%
#20google/gemini-2.0-flash-001
11.9%
#23Llama-3.3-70B-Instruct
11.3%
#24Qwen3-30B-A3B
11.1%
#26Qwen2.5-Coder-7B
11.0%
#33gemma-2-27b-it
10.1%
#35phi-4
9.8%
#37Phi-3-medium-128k-instruct
9.5%
#38Qwen3-32B
9.3%
#41gpt-4.1-20250414
9.1%
#42QwQ-32B-Preview
9.0%
#44Meta-Llama-3.1-8B
8.5%
#47gemini-3-pro-preview
7.8%
#48deepseek-v3
7.7%
#53gemini-2.5-pro
7.4%
#54Grok-4-0709
7.4%
#55Phi-3-mini-128k-instruct
7.4%
#57claude-sonnet-4-20250514
7.1%
#59Llama-3.1-70B-Instruct
6.9%
#68Meta-Llama-3-8B-Instruct
5.7%
#69Qwen2.5-Coder-1.5B-Instruct
5.6%
#70DeepSeek-Coder-V2-Lite-Instruct
5.3%
#75minimax/minimax-m2.1
4.3%
#77gemma-2
4.0%
#82starcoder2-15b
1.7%

Head-to-Head: #1 vs #2

#1

Top Pick

gpt-4o-20241120

Strong on DuckDB NSQL Leaderboard all_execution_accuracy (96%) and DuckDB NSQL Leaderboard hard_execution_accuracy (75%)

24.5%

Conf 44.7%

#3

gpt-4o

Strong on DuckDB NSQL Leaderboard all_execution_accuracy (77%) and JSONSchemaBench Leaderboard medium_schema_compliance_pct (100%)

20.3%

Conf 41.9%

Related Lookups