ministral-3b-2410

Ministral 3B
via mistral

Release Date

Oct 16th, 2024

Context Size

128k

Reasoning

No

Benchmark Cost

$0.14

Speed

Categories

20%40%60%80%100%Creative Writing75.5%Tooling87.6%Language42.2%Utility49.2%Reasoning69.7%Text Editing70.9%Rule Following24.4%Hallucination70.7%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
777472716071%
797975747276%
837979777579%
767474727073%
777675747175%
828078747077%
Detailed Writing Rules74.99%
genre
717069676668%
787876766975%
807979797078%
827673727175%
827876676574%
847978747277%
genre74.51%
Novelcrafter Default Prompt
777372696671%
797976757477%
828178767077%
787777747376%
827978747377%
827875747476%
Novelcrafter Default Prompt75.88%
75.12%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
928989888789%
868585858485%
938989878388%
939290908991%
88.15%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
6969615856534459%
100100959595957794%
9696908176696582%
5959494242393947%
9676717067616172%
8888888885858587%
6562626155525158%
9479616060565266%
9778787264615472%
Generic Prompt70.72%
Specific Prompt
6464646161504759%
100100100100100100100100%
9292929292929192%
9797979796969697%
100100100100100100100100%
1001001001001001009399%
7978787876766475%
10010010010098989699%
10010010010099999599%
Specific Prompt91.11%
80.92%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001000080%
80.00%