BasedAGIBasedAGI

marketing_sales

Best Model for Ad Copy Generation

Ranked models for generating diverse headline and CTA variants under strict constraints.

#1 Recommendation

qwen-2.5-72b-instruct

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

external/qwen/qwen-2-5-72b-instruct

29.1%

Score

45.3%

Confidence

15

Evidence

Ranked Models

30

Evidence Quality

97%

Evidence Points

15

Top Signal

Open LLM Leaderboard MMLU-Pro: mmlu_pro_accuracy_pct

Benchmark Sources

35

Last Updated

11h ago

All Ranked Models

30 of 30 models
RankModelScore
🥇qwen-2.5-72b-instruct

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

29.1%
🥈claude-sonnet-4

Strong on Galileo Agent Leaderboard v2 Avg TSQ and EQ-Bench Leaderboard eq_bench_score

23.7%
🥉Grok-4-0709

Strong on Galileo Agent Leaderboard v2 Avg TSQ and EQ-Bench Leaderboard eq_bench_score

23.6%
#4gemini-2.5-pro

Strong on EQ-Bench Leaderboard eq_bench_score and Galileo Agent Leaderboard v2 Avg TSQ

23.0%
#5Mistral-Large-Instruct-2411

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

21.9%
#6gpt-5-2025-08-07

Strong on EQ-Bench Leaderboard eq_bench_score and UGI Leaderboard Writing ✍️

21.2%
#7Mixtral-8x22B-Instruct-v0.1

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

20.6%
#8gemma-2-27b-it

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

20.0%
#9Steelskull/L3.3-MS-Nevoria-70b

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

19.8%
#10o3-20250416

Strong on EQ-Bench Leaderboard eq_bench_score and UGI Leaderboard Writing ✍️

19.3%
#11gpt-4o

Strong on CRMArena Function Calling overall_score_pct and EQ-Bench Leaderboard eq_bench_score

19.2%
#12Qwen2-72B-Instruct

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

19.1%
#13Sao10K/70B-L3.3-Cirrus-x1

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

18.7%
#14RYS-XLarge

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

18.7%
#15RYS-XLarge-base

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

18.5%
#16MaziyarPanahi/calme-3.2-instruct-78b

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

18.4%
#17Steelskull/L3.3-Nevoria-R1-70b

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

18.3%
#18Qwen2.5-32B-Instruct

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

18.3%
#19phi-4

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

18.1%
#20gpt-4.1-20250414

Strong on Galileo Agent Leaderboard v2 Avg TSQ and Galileo Agent Leaderboard v2 Avg AC

18.0%
#21MaziyarPanahi/calme-2.4-rys-78b

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

17.9%
#22MaziyarPanahi/calme-3.1-instruct-78b

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

17.9%
#23wizardlm-2-8x22b

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

17.9%
#24Tarek07/Progenitor-V1.1-LLaMa-70B

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

17.8%
#25CalmeRys-78B-Orpo-v0.1

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

17.8%
#26solar-pro-preview-instruct

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

17.5%
#27Apollo-70B

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

17.3%
#28Triangle104/Set-70b

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

17.3%
#29Homer-v1.0-Qwen2.5-72B

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

17.1%
#30Tarek07/Thalassic-Alpha-LLaMa-70B

Strong on Open LLM Leaderboard GPQA gpqa and Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct

17.1%

Head-to-Head: #1 vs #2

#1

Top Pick

qwen-2.5-72b-instruct

Strong on Open LLM Leaderboard MMLU-Pro mmlu_pro_accuracy_pct and Open LLM Leaderboard GPQA gpqa

29.1%

Conf 45.3%

#2

anthropic/claude-sonnet-4

Strong on Galileo Agent Leaderboard v2 Avg TSQ and EQ-Bench Leaderboard eq_bench_score

23.7%

Conf 30.6%

Related Lookups