ministral-8b-2410

Ministral 8B
via mistral

Release Date

Oct 16th, 2024

Context Size

128k

Reasoning

No

Benchmark Cost

$0.33

Speed

Categories

20%40%60%80%100%Creative Writing76.9%Tooling85.6%Language53.9%Utility46.8%Reasoning73.8%Text Editing77.5%Rule Following15.3%Hallucination89.2%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
797268666670%
828176766676%
828180797379%
797877716774%
867978767178%
817776757276%
Detailed Writing Rules75.67%
genre
787571706672%
847979767278%
878483817783%
807776686373%
827571706773%
838180777579%
genre76.27%
Novelcrafter Default Prompt
797371706972%
848382797480%
777776767576%
857978787078%
898481807381%
838077767578%
Novelcrafter Default Prompt77.71%
76.55%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
929291919091%
949493939293%
929191908890%
999790898792%
91.70%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
9292866461565672%
100100100100100100100100%
6866636257545360%
9898989896918995%
6565636363564560%
7574747474747474%
7672727171696871%
7169686665616066%
9998989897979197%
Generic Prompt77.31%
Specific Prompt
9797979794928394%
100100100100100100100100%
9999999999949197%
100100100100100100100100%
100100100100100100100100%
100100100100100100100100%
7675757575757475%
100100100100100100100100%
9999999999999598%
Specific Prompt96.02%
86.67%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001000080%
80.00%