inception/mercury-2

Inception Mercury 2

Release Date

Mar 4th, 2026

Context Size

128k

Reasoning

No

Benchmark Cost

$1.47

Speed

665.0 tok/s

Categories

20%40%60%80%100%Creative Writing68.3%Tooling98.0%Language87.3%Utility92.9%Reasoning92.0%Text Editing85.3%Rule Following54.4%Hallucination92.6%

Subcategories

20%40%60%80%100%AI-ismsProse VarietyDialoguePurple ProseMechanical StyleClichésXMLComprehensionGenerationWord CountingSentence CountingParagraph CountingStructural CountingData ExtractionDeductionAttentionTransformationPreservationStructural IntegrityConstraint AdherenceFalse PositivesContent InventionOutput Corruption

Bad Writing Habits

Detects common prose quality anti-patterns in AI-generated creative writing, including passive voice, past progressive overuse, weak dialogue tags, filter words, purple prose, cliches, AI-ism words/adverbs/names, and more.

Scenario #1 #2 #3 #4 #5 Total
Detailed Writing Rules
717166636267%
696562616164%
777675726673%
757470706871%
767673727274%
726866646366%
Detailed Writing Rules69.14%
genre
656363615662%
646261615861%
706868666567%
686867666567%
656262615862%
656463636263%
genre63.66%
Novelcrafter Default Prompt
676665656565%
666662606063%
777574747374%
808074717175%
767676727074%
717070696869%
Novelcrafter Default Prompt70.26%
67.69%

Codex Extraction

Evaluates a model's ability to extract structured codex entries (characters, locations, objects, lore) from prose passages and return them as well-formed XML.

Scenario #1 #2 #3 #4 #5 Total
939392929192%
989594949395%
919089898889%
949493898892%
92.02%

Text Replacement

Tests deterministic text transformations: renaming characters/locations, expanding contractions, tense rewriting, POV shifts, gender swaps, combined transformations, and word avoidance. Scored by checking each expected change independently.

Scenario #1 #2 #3 #4 #5 #6 #7 Total
Generic Prompt
100100100100100100100100%
100100100100100100100100%
9796969595958494%
1001001009999998397%
100100100100100100100100%
9999999995959597%
9090787877747180%
100100100100100100100100%
9797959595928794%
Generic Prompt95.85%
Specific Prompt
100100100100100100100100%
100100100100100100100100%
9898989595939196%
1001001001001009999100%
100100100100100100100100%
9595959595959595%
9897525252423761%
100100100100100100100100%
100100999393898794%
Specific Prompt94.02%
94.93%

Tool usage within Novelcrafter

Output messages that are related to tool usage within Novelcrafter

Scenario #1 #2 #3 #4 #5 #6 #7 #8 #9 #10 Total
1001001001001001001001001006797%
96.67%