deepseek/deepseek-chat-v3-0324

DeepSeek V3 (2025-03-24)

Release Date

Mar 24th, 2025

Context Size

163.8k

Reasoning

No

Benchmark Cost

$0.66

Speed

34.1 tok/s

Categories

20%40%60%80%100%Creative Writing82.3%Tooling93.5%Language86.4%Utility80.6%Reasoning88.5%Text Editing89.6%Rule Following67.9%Hallucination67.1%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
857776767578%
908783817483%
878785858085%
898981777782%
858381818082%
888685827984%
Detailed Writing Rules82.38%
genre
817774727275%
908785828285%
918786807885%
838380807280%
878684848285%
948180767381%
genre81.66%
Novelcrafter Default Prompt
808079747377%
848383807581%
848381817781%
878383807582%
868484828284%
828181767679%
Novelcrafter Default Prompt80.57%
81.53%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
969593929294%
969493929194%
969695959595%
1009999999698%
95.18%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
1001001001001001008598%
9898979797979697%
10010010010010010099100%
10010010010095935892%
100100917474747083%
9593939286787788%
1001001009796966694%
9696969595892084%
Generic Prompt92.87%
Specific Prompt
100100100100100100100100%
1001001001001001005493%
999999999999085%
1001001001001009998100%
100100100100100100100100%
1001001001001001009599%
9797969594702081%
10010010010010046078%
100100100100100100100100%
Specific Prompt92.97%
92.92%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100090%
90.00%