BasedAGIBasedAGI
Menu
Rankings live

education

Best LLM for Tutoring

Compare models for Socratic teaching with guiding questions and stepwise hints.

#1 Recommendation

gpt-4.1-20250414

Strong on OpenVLM TextVQA Official textvqa_score_pct (77%) and OpenVLM OCRBench Official ocrbench_score_pct (88%)

external/openai/gpt-4-1-20250414

23.3%

Score

36.1%

Confidence

23

Evidence

Ranked Models

24

Evidence Quality

80%

Scoring

Benchmark-backed

Top Signal

OpenVLM TextVQA Official: textvqa_score_pct

All Ranked Models

Max params:
Min confidence:
24 of 24
RankModelScore
#1gpt-4.1-20250414

Strong on OpenVLM TextVQA Official textvqa_score_pct (77%) and OpenVLM OCRBench Official ocrbench_score_pct (88%)

23.3%
#5gpt-4.1-mini-20250414
19.4%
#15gemini-2.5-flash
16.2%
#30google/gemini-2.0-flash-001
14.3%
#50gemini-2.5-pro
12.3%
#53gpt-5-2025-08-07
11.9%
#60google/gemini-3.1-pro-preview
11.6%
#62Qwen-VL-Chat
11.4%
#64gpt-5-mini-2025-08-07
11.3%
#66Llama-3.1-70B-Instruct
11.1%
#83gpt-4o
9.8%
#89gemini-3-pro-preview
9.6%
#97Grok-4-0709
9.1%
#98Llama-3.3-70B-Instruct
9.0%
#99GPT-4.1-nano-2025-04-14
9.0%
#117claude-sonnet-4-20250514
8.3%
#123kimi/kimi-k2.5-thinking
8.1%
#141phi-4
6.7%
#147deepseek/deepseek-r1
6.0%
#148qwen-2.5-72b-instruct
5.9%
#155Meta-Llama-3-8B-Instruct
4.6%
#157openai/gpt-4o-mini-2024-07-18
4.4%
#160Phi-4-multimodal-instruct
3.4%
#169Qwen3-30B-A3B
0.9%

Head-to-Head: #1 vs #2

#1

Top Pick

gpt-4.1-20250414

Strong on OpenVLM TextVQA Official textvqa_score_pct (77%) and OpenVLM OCRBench Official ocrbench_score_pct (88%)

23.3%

Conf 36.1%

#5

gpt-4.1-mini-20250414

Strong on OpenVLM OCRBench Official ocrbench_score_pct (88%) and OpenVLM TextVQA Official textvqa_score_pct (70%)

19.4%

Conf 30.3%

Related Lookups