deepseek/deepseek-v4-pro

DeepSeek V4 Pro (Reasoning)

Release Date

Apr 24th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$7.84

Speed

34.7 tok/s

Categories

20%40%60%80%100%Creative Writing83.0%Tooling99.3%Language88.5%Utility93.2%Reasoning94.6%Text Editing98.6%Rule Following72.7%Hallucination90.9%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
878685817883%
898988868587%
898888858487%
898786868486%
898786858286%
929190878489%
Detailed Writing Rules86.44%
genre
828075737076%
898481767581%
817776767677%
838181767579%
888178787680%
818079766977%
genre78.44%
Novelcrafter Default Prompt
807977727076%
848279797379%
918886827885%
888482828183%
898886828185%
908881807783%
Novelcrafter Default Prompt81.97%
82.28%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
989797969597%
969696949496%
999898989798%
999999999598%
97.08%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
100100999999999899%
10010010010010010098100%
100100100100100100100100%
100100100100100100100100%
9796969695959596%
100100100100100100100100%
1001001009997908295%
Generic Prompt98.90%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
100100100100999999100%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
100100999898989798%
100100100100100100100100%
100100100100100100100100%
Specific Prompt99.80%
99.35%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%