deepseek/deepseek-chat-v3.1

DeepSeek V3.1

Release Date

Aug 21st, 2025

Context Size

163.8k

Reasoning

No

Benchmark Cost

$0.74

Speed

23.6 tok/s

Categories

20%40%60%80%100%Creative Writing77.4%Tooling98.0%Language96.9%Utility76.7%Reasoning84.0%Text Editing87.3%Rule Following66.2%Hallucination72.8%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
797776747175%
808078777177%
807978757277%
858280807580%
838079787379%
848180797881%
Detailed Writing Rules78.10%
genre
777371666170%
787877747075%
757272696570%
787675717074%
777674737375%
828281757078%
genre73.73%
Novelcrafter Default Prompt
847671686773%
868477766878%
818078767478%
838281777680%
787777757476%
868380787781%
Novelcrafter Default Prompt77.65%
76.49%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
98989796378%
989796959496%
979695959395%
969696949495%
91.20%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
1001001001001001008398%
100100100100100954191%
9999999999999999%
10010010010010010098100%
1001001001001001009699%
747474747474564%
9897979695954389%
100100100100100100100100%
888781818143967%
Generic Prompt89.56%
Specific Prompt
1001001001001001008698%
100100100100100100100100%
9999999999992188%
100100100100389364%
100100100100100100100100%
100100100100100964592%
9898989795954690%
100100100100100413782%
1001001001001001003290%
Specific Prompt89.39%
89.48%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%