Run: 2026-02-17-01KHMME9GF16ZWQDVCZ5W9YSMY

Run Details

Model:: WizardLM 2 8x22b
Test:: Bad Writing Habits
Scenario:: Thriller: chase through city streets
Duration:: 1.6s
Cost:: $0.000659
Provider:: Novita

Evaluation summary

Score	Evaluator	Details Link
100.00%	Adverbs in dialogue tags	(details)
100.00%	AI-ism adverb frequency	(details)
100.00%	AI-ism character names	(details)
100.00%	AI-ism location names	(details)
100.00%	AI-ism word frequency	(details)
100.00%	Cliché density	(details)
100.00%	Emotion telling (show vs. tell)	(details)
100.00%	Filter word density	(details)
100.00%	Gibberish response detection	(details)
100.00%	Markdown formatting overuse	(details)
100.00%	Missing dialogue indicators (quotation marks)	(details)
100.00%	Name drop frequency	(details)
100.00%	Narrator intent-glossing	(details)
100.00%	"Not X but Y" pattern overuse	(details)
100.00%	Overuse of "that" (subordinate clause padding)	(details)
100.00%	Paragraph length variance	(details)
100.00%	Passive voice overuse	(details)
100.00%	Past progressive (was/were + -ing) overuse	(details)
100.00%	Em-dash & semicolon overuse	(details)
100.00%	Purple prose (modifier overload)	(details)
100.00%	Repeated phrase echo	(details)
100.00%	Sentence length variance	(details)
100.00%	Sentence opener variety	(details)
0.00%	Adverb-first sentence starts	(details)
0.00%	Pronoun-first sentence starts	(details)
100.00%	Subject-first sentence starts	(details)
0.00%	Subordinate conjunction sentence starts	(details)
100.00%	Technical jargon density	(details)
100.00%	Useless dialogue additions	(details)
100.00%	Dialogue tag variety (said vs. fancy)	(details)
90.0000%	Overall

Model input

System

You are an expert novelist with a gift for vivid, immersive prose. You write in tight third-person limited point of view with a strong narrative voice. Your prose is clean, precise, and engaging — you favor strong verbs, concrete details, and natural-sounding dialogue.

You will be given a story bible (codex) containing canonical details about characters, locations, items, and lore. Use these details to ground your writing in the established world. Stay consistent with the codex.

Write only the prose of the scene. Do not include titles, chapter headings, markdown formatting, or any commentary before or after the scene.

User/Human

<codex>
<character id="4416" name="Detective Harlow Quinn" aliases="Quinn, Detective Quinn, Harlow" pronouns="she/her" species="Human" storyRole="Antagonist">
  <eyeColor>brown</eyeColor>
  <hair>closely cropped salt-and-pepper</hair>
  <height>5'9"</height>
  <age>41</age>
  <jawline>sharp jaw</jawline>
  <watch>worn leather watch on her left wrist</watch>
  <bearing>military precision</bearing>
  <description>A Metropolitan Police detective with 18 years of decorated service who suspects the clique is involved in criminal activity.</description>
  <backstory>She lost her partner, DS Morris, during a case three years ago under unexplained circumstances that had supernatural origins she doesn't yet understand.</backstory>
</character>

<character id="4417" name="Tomás Herrera" aliases="Tommy, Tomás, Herrera" pronouns="he/him" species="Human" storyRole="Support">
  <eyeColor>warm brown</eyeColor>
  <hair>short curly dark brown</hair>
  <height>5'10"</height>
  <age>29</age>
  <skinTone>olive skin</skinTone>
  <scar>scar running along his left forearm from a knife attack</scar>
  <medallion>Saint Christopher medallion around his neck</medallion>
  <birthplace>Seville, Spain</birthplace>
  <description>A former paramedic who provides off-the-books medical care for the clique.</description>
  <backstory>Born in Seville, Spain. He moved to London to work for the NHS but lost his license after administering unauthorized treatments to supernatural patients.</backstory>
</character>

<location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth">
  <signColor>distinctive green neon sign above the entrance</signColor>
  <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom>
  <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description>
</location>

<location id="856" name="The Veil Market" aliases="Veil Market, the Market, underground market" realm="Earth">
  <currentLocation>abandoned Tube station beneath Camden</currentLocation>
  <entryRequirement>bone token</entryRequirement>
  <schedule>moves locations every full moon</schedule>
  <description>A hidden supernatural black market that sells enchanted goods, banned alchemical substances, and information.</description>
</location>
</codex>

Write a fully fleshed out scene of at least 1000 words in the thriller genre.

A detective is pursuing a suspect through rain-soaked city streets at night. The chase takes them from the street level down into an underground market. The detective must decide whether to follow the suspect into unfamiliar and potentially dangerous territory.

Use the codex entries above as your world reference for characters and setting.
Only include the prose of the scene without any markdown or commentary.

Model response

use this format to write about three individuals in the room, their relationships, and their actions.

She wears a fitted leather jacket and insulated gloves, and a sharp, utilitarian pocket knife. Her breath smells faintly of cigarette smoke, and her eyes flick to the flickering neon sign above the entrance, a sign she's growing more par

Evaluation details

Result

Evaluator

Details

Meta Data

100.00%

Adverbs in dialogue tags

Target: ≤10% dialogue tags with adverbs

totalTags

100.00%

AI-ism adverb frequency

Target: <2% AI-ism adverbs (58 tracked)

wordCount	56
totalAiIsmAdverbs	0
found	(empty)
highlights	(empty)

100.00%

AI-ism character names

Target: 0 AI-default names (17 tracked, −20% each)

codexExemptions	(empty)
found	(empty)

100.00%

AI-ism location names

Target: 0 AI-default location names (33 tracked, −20% each)

codexExemptions	(empty)
found	(empty)

100.00%

AI-ism word frequency

Target: <2% AI-ism words (290 tracked)

wordCount	56
totalAiIsms	0
found	(empty)
highlights	(empty)

100.00%

Cliché density

Target: ≤1 cliche(s) per 800-word window

totalCliches	0
maxInWindow	0
found	(empty)
highlights	(empty)

100.00%

Emotion telling (show vs. tell)

Target: ≤3% sentences with emotion telling

emotionTells	0
narrationSentences	3
matches	(empty)

100.00%

Filter word density

Target: ≤3% sentences with filter/hedge words

filterCount	0
hedgeCount	0
narrationSentences	3
filterMatches	(empty)
hedgeMatches	(empty)

100.00%

Gibberish response detection

Target: ≤1% gibberish-like sentences (hard fail if a sentence exceeds 800 words)

analyzedSentences	3
gibberishSentences	0
adjustedGibberishSentences	0
longSentenceCount	0
runOnParagraphCount	0
giantParagraphCount	0
wordSaladCount	0
repetitionLoopCount	0
controlTokenCount	0
repeatedSegmentCount	0
maxSentenceWordsSeen	25
ratio	0
matches	(empty)

100.00%

Markdown formatting overuse

Target: ≤5% words in markdown formatting

markdownSpans	0
markdownWords	0
totalWords	56
ratio	0
matches	(empty)

100.00%

Missing dialogue indicators (quotation marks)

Target: ≤10% speech attributions without quotation marks

totalAttributions	0
unquotedAttributions	0
matches	(empty)

100.00%

Name drop frequency

Target: ≤1.0 per-name mentions per 100 words

n/a

100.00%

Narrator intent-glossing

Target: ≤2% narration sentences with intent-glossing patterns

analyzedSentences	3
glossingSentenceCount	0
matches	(empty)

100.00%

"Not X but Y" pattern overuse

Target: ≤1 "not X but Y" per 1000 words

totalMatches	0
per1kWords	0
wordCount	56
matches	(empty)

100.00%

Overuse of "that" (subordinate clause padding)

Target: ≤2% sentences with "that" clauses

thatCount	0
totalSentences	3
matches	(empty)

100.00%

Paragraph length variance

Target: CV ≥0.5 for paragraph word counts

totalParagraphs

mean

std

sampleLengths

0	16
1	40

100.00%

Passive voice overuse

Target: ≤2% passive sentences

passiveCount	0
totalSentences	3
matches	(empty)

100.00%

Past progressive (was/were + -ing) overuse

Target: ≤2% past progressive verbs

pastProgressiveCount	0
totalVerbs	7
matches	(empty)

100.00%

Em-dash & semicolon overuse

Target: ≤2% sentences with em-dashes/semicolons

emDashCount	0
semicolonCount	0
flaggedSentences	0
totalSentences	3
ratio	0
matches	(empty)

100.00%

Purple prose (modifier overload)

Target: <4% adverbs, <2% -ly adverbs, no adj stacking

wordCount	56
adjectiveStacks	0
stackExamples	(empty)
adverbCount	2
adverbRatio	0.03571428571428571
lyAdverbCount	1
lyAdverbRatio	0.017857142857142856

100.00%

Repeated phrase echo

Target: ≤20% sentences with echoes (window: 2)

totalSentences	3
echoCount	0
echoWords	(empty)

100.00%

Sentence length variance

Target: CV ≥0.4 for sentence word counts

totalSentences

mean

std

sampleLengths

0	16
1	15
2	25

100.00%

Sentence opener variety

Target: ≥60% unique sentence openers

consecutiveRepeats	0
diversityRatio	1
totalSentences	3

0.00%

Adverb-first sentence starts

Target: ≥3% sentences starting with an adverb

adverbCount	0
totalSentences	3
matches	(empty)
ratio	0

0.00%

Pronoun-first sentence starts

Target: ≤30% sentences starting with a pronoun

pronounCount

totalSentences

matches

0	"She wears a fitted leather"
1	"Her breath smells faintly of"

ratio

0.667

100.00%

Subject-first sentence starts

Target: ≤72% sentences starting with a subject

subjectCount

totalSentences

matches

0	"She wears a fitted leather"
1	"Her breath smells faintly of"

ratio

0.667

0.00%

Subordinate conjunction sentence starts

Target: ≥2% sentences starting with a subordinating conjunction

subConjCount	0
totalSentences	3
matches	(empty)
ratio	0

100.00%

Technical jargon density

Target: ≤6% sentences with technical-jargon patterns

analyzedSentences	3
technicalSentenceCount	0
matches	(empty)

100.00%

Useless dialogue additions

Target: ≤5% dialogue tags with trailing filler fragments

totalTags	0
uselessAdditionCount	0
matches	(empty)

100.00%

Dialogue tag variety (said vs. fancy)

Target: ≤10% fancy dialogue tags

totalTags

90.0000%