deepseek/deepseek-v4-flash

DeepSeek V4 Flash (Reasoning)

Release Date

Apr 24th, 2026

Context Size

1m

Reasoning

Yes

Benchmark Cost

$0.41

Speed

64.8 tok/s

Categories

20%40%60%80%100%Creative Writing83.0%Tooling96.0%Language94.8%Utility87.5%Reasoning95.1%Text Editing96.1%Rule Following64.5%Hallucination95.0%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
817876747276%
908987848086%
888782817984%
919188878689%
939088868688%
939289888589%
Detailed Writing Rules85.34%
genre
837877727276%
838280807881%
848382797881%
828281777379%
868382777781%
848279747479%
genre79.46%
Novelcrafter Default Prompt
807876767477%
908483777682%
828181807780%
878583817783%
868684837483%
868282777380%
Novelcrafter Default Prompt80.78%
81.86%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979797969697%
989797979597%
979696969596%
1009999999999%
97.06%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
10099999999999699%
10010010010010010099100%
100100100100100100100100%
100100100100100100100100%
9796959594948794%
100100100100100100100100%
100100999999998197%
Generic Prompt98.82%
Specific Prompt
1001001007269695681%
100100100100100100100100%
10010010010099999799%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
9999989896959497%
100100100100100100100100%
10010010010099969599%
Specific Prompt97.36%
98.09%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100676793%
93.33%