BasedAGIBasedAGI
Menu
Rankings live

creative

Best LLM for Screenwriting

Ranked models for writing screenplay scenes with formatting, pacing, and dialogue.

#1 Recommendation

qwen-2.5-72b-instruct

Strong on Creative Writing Official (EQ-Bench Slice) creative_writing_score (78%) and Judgemark Official (EQ-Bench Slice) judgemark_score (56%)

external/qwen/qwen-2-5-72b-instruct

29.9%

Score

48.2%

Confidence

13

Evidence

Ranked Models

30

Evidence Quality

80%

Scoring

Benchmark-backed

Top Signal

Creative Writing Official (EQ-Bench Slice): creative_writing_score

All Ranked Models

Max params:
Min confidence:
30 of 30
RankModelScore
#4qwen-2.5-72b-instruct
29.9%
#5gpt-4o
28.5%
#9gemini-2.5-pro
25.4%
#10Grok-4-0709
23.4%
#12gpt-4.1-20250414
22.8%
#16gemma-2-27b-it
18.6%
#18xai-org/grok-4-fast-reasoning
16.6%
#23xai-org/grok-4-1-fast-reasoning
15.7%
#24gemini-3-pro-preview
15.6%
#27grok/grok-4.20-beta-0309-reasoning
15.0%
#29gemini-3-flash-preview
14.7%
#31x-ai/grok-3
14.5%
#33claude-sonnet-4-20250514
14.2%
#34google/gemini-3.1-pro-preview
14.2%
#36gemini-2.5-flash
14.0%
#41gpt-5-2025-08-07
13.1%
#42openai/gpt-5.4-2026-03-05
12.9%
#44gpt-5.1-2025-11-13
12.5%
#46anthropic/claude-sonnet-4.6
12.4%
#47claude-opus-4-5-20251101
12.3%
#49gpt-5-mini-2025-08-07
12.0%
#50xai-org/grok-4-1-fast-non-reasoning
12.0%
#51Kimi-K2-Instruct
11.9%
#53anthropic/claude-opus-4-6-thinking
11.7%
#54gpt-5.2-2025-12-11
11.6%
#55gpt-4o-2024-05-13
11.5%
#58anthropic/claude-opus-4-5-20251101-thinking
11.4%
#59xai-org/grok-4-fast-non-reasoning
11.3%
#60qwen/qwen3-max
10.9%
#63DeepSeek-V2.5
10.8%

Head-to-Head: #1 vs #2

#4

Top Pick

qwen-2.5-72b-instruct

Strong on Creative Writing Official (EQ-Bench Slice) creative_writing_score (78%) and Judgemark Official (EQ-Bench Slice) judgemark_score (56%)

29.9%

Conf 48.2%

#5

gpt-4o

Strong on Creative Writing Official (EQ-Bench Slice) creative_writing_score (84%) and Judgemark Official (EQ-Bench Slice) judgemark_score (74%)

28.5%

Conf 36.5%

Related Lookups