BasedAGIBasedAGI
Menu
Rankings live

creative

Best Model for Creative Longform Writing

Ranked models for generating and refining long-form fiction with continuity.

#1 Recommendation

qwen-2.5-72b-instruct

Strong on Creative Writing Official (EQ-Bench Slice) creative_writing_score (78%) and Judgemark Official (EQ-Bench Slice) judgemark_score (56%)

external/qwen/qwen-2-5-72b-instruct

25.3%

Score

40.8%

Confidence

13

Evidence

Ranked Models

30

Evidence Quality

79%

Scoring

Benchmark-backed

Top Signal

Creative Writing Official (EQ-Bench Slice): creative_writing_score

All Ranked Models

Max params:
Min confidence:
30 of 30
RankModelScore
#4qwen-2.5-72b-instruct
25.3%
#5gpt-4o
24.1%
#9gpt-4.1-20250414
21.8%
#10gemini-2.5-pro
21.5%
#11Grok-4-0709
19.9%
#16gemma-2-27b-it
15.7%
#18xai-org/grok-4-fast-reasoning
14.1%
#23xai-org/grok-4-1-fast-reasoning
13.3%
#24gemini-3-pro-preview
13.3%
#27grok/grok-4.20-beta-0309-reasoning
12.7%
#30gemini-3-flash-preview
12.5%
#32x-ai/grok-3
12.3%
#34claude-sonnet-4-20250514
12.1%
#35google/gemini-3.1-pro-preview
12.0%
#37gemini-2.5-flash
11.9%
#43gpt-5-2025-08-07
11.1%
#44openai/gpt-5.4-2026-03-05
10.9%
#46gpt-5.1-2025-11-13
10.6%
#47anthropic/claude-sonnet-4.6
10.5%
#48claude-opus-4-5-20251101
10.4%
#51gpt-5-mini-2025-08-07
10.2%
#52xai-org/grok-4-1-fast-non-reasoning
10.1%
#53Kimi-K2-Instruct
10.1%
#55anthropic/claude-opus-4-6-thinking
10.0%
#56gpt-5.2-2025-12-11
9.8%
#57gpt-4o-2024-05-13
9.8%
#59anthropic/claude-opus-4-5-20251101-thinking
9.6%
#60xai-org/grok-4-fast-non-reasoning
9.5%
#62qwen/qwen3-max
9.2%
#65DeepSeek-V2.5
9.1%

Head-to-Head: #1 vs #2

#4

Top Pick

qwen-2.5-72b-instruct

Strong on Creative Writing Official (EQ-Bench Slice) creative_writing_score (78%) and Judgemark Official (EQ-Bench Slice) judgemark_score (56%)

25.3%

Conf 40.8%

#5

gpt-4o

Strong on Creative Writing Official (EQ-Bench Slice) creative_writing_score (84%) and Judgemark Official (EQ-Bench Slice) judgemark_score (74%)

24.1%

Conf 30.9%

Related Lookups