deepseek/deepseek-chat

DeepSeek-V2 Chat

Release Date

May 6th, 2024

Context Size

128k

Reasoning

No

Benchmark Cost

$0.90

Speed

261.3 tok/s

Categories

20%40%60%80%100%Creative Writing77.2%Tooling99.8%Language100.0%Utility83.8%Reasoning88.7%Text Editing90.9%Rule Following68.8%Hallucination69.5%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
877979786678%
838075737176%
878179787780%
818180787679%
857674737376%
868577766778%
Detailed Writing Rules78.09%
genre
797676757576%
857971716875%
807873726774%
848180797079%
757373736672%
888579767580%
genre75.99%
Novelcrafter Default Prompt
747473716972%
807776696473%
897978777680%
807676737175%
857974737177%
878480797280%
Novelcrafter Default Prompt76.24%
76.78%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
979696949395%
959595948894%
979796929094%
999895949396%
94.78%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9898989797979797%
100100100100100100100100%
1001001001001001009699%
1001001007474747485%
9494939393929193%
1001001009999979799%
9696969589888893%
Generic Prompt96.21%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9999999999999999%
10010010010010010099100%
100100100100100100100100%
100100100100100100100100%
9794949291202073%
100100100100100100100100%
100100100100100100100100%
Specific Prompt96.87%
96.54%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
100100100100100100100100100100100%
100.00%