open-mistral-nemo-2407

Mistral NeMO
via mistral

Release Date

Jul 18th, 2024

Context Size

128k

Reasoning

No

Benchmark Cost

$0.40

Speed

Categories

20%40%60%80%100%Creative Writing76.7%Tooling83.2%Language80.8%Utility51.6%Reasoning57.6%Text Editing73.7%Rule Following34.1%Hallucination62.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
747373736872%
797776706874%
818175747076%
828281807680%
828280737077%
877473707075%
Detailed Writing Rules75.94%
genre
807974676573%
848381696877%
817978777378%
878080807781%
827573716774%
837775757377%
genre76.44%
Novelcrafter Default Prompt
817170696772%
757474737173%
818180777679%
797875737276%
797774737275%
787877767477%
Novelcrafter Default Prompt75.22%
75.87%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
000000%
000000%
8685830051%
9390850054%
26.12%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
9794949494928994%
10010010010098985893%
8886807979787781%
9998989898989898%
8967636060565464%
7474747474747474%
6352493224201737%
8581816863444166%
9999989898979698%
Generic Prompt78.28%
Specific Prompt
9292929283816185%
100100100100100100100100%
9999999999999398%
9999999999969498%
9696969696969395%
100100100100100100100100%
8885787878787880%
100100100100100100100100%
9999999999999999%
Specific Prompt94.99%
86.64%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%