NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-21-01KJ12QV8GJYNM0T310HCX6NAW
Run Details
Model:
Ministral 8B
Test:
Codex Red Herring (False Positive Detection)
Scenario:
Short text (~524 words), big codex (51 entries)
Duration:
15.8s
Cost:
$0.000563
Evaluation summary
Score
Evaluator
Details Link
50.00%
Correct "no violations" response
(details)
1.41%
No hallucinated violations
(details)
25.7042%
Overall
Model input
System
You are a meticulous continuity editor for a fiction project. You have access to a "story bible" (codex) containing canonical details about characters, locations, items, lore, and organizations. Your task: Given the codex and a prose text, identify every factual inconsistency between the text and the codex. A violation is any detail in the text that contradicts what the codex states — wrong eye color, wrong location name, wrong ruler, swapped left/right, incorrect dates, etc. Be thorough. Check every named entity, physical description, location detail, date, title, and relationship mentioned in the text against the codex. Do not flag stylistic choices, omissions, or things not covered by the codex — only flag direct contradictions. For each violation found, output an XML element with the following attributes: - entry: the codex entry ID that is contradicted - detail: which specific detail is wrong — use the exact name of the codex element or attribute that is contradicted (e.g. "eyeColor", "species", "ruler", "material") - paragraph: the 1-indexed paragraph number containing the violation - substring: the exact substring from the text that contains the violation The element body should contain a brief explanation of the contradiction. Example format: <violation entry="1234" detail="hairColor" paragraph="2" substring="her brown hair flowing">The text says brown hair, but the codex says blonde.</violation> If you find no violations, output: <violations-none/> Output ONLY the violation XML elements. Do not include any other text, preamble, or explanation outside the XML.
User/Human
<codex> <character id="100" name="Torben Reikhart" aliases="Stray"> <gender>Male</gender> <eyeColor>Green</eyeColor> <hairColor>Silver</hairColor> </character> <character id="101" name="Sable Dunmore"> <species>Human</species> <gender>Female</gender> <age>28</age> <eyeColor>Brown</eyeColor> <hairColor>Auburn</hairColor> </character> <character id="102" name="Old Rivka"> <gender>Female</gender> </character> <character id="103" name="Petal Dunmore"> <gender>Female</gender> </character> <character id="104" name="Jurren Dunmore"> <gender>Male</gender> </character> <character id="111" name="Grunn Tetch" aliases="Old Tetch"> <species>Dwarf</species> <gender>Male</gender> <eyeColor>Amber</eyeColor> <hairColor>Gray</hairColor> </character> <character id="112" name="Dol Tetch"> <gender>Male</gender> </character> <character id="113" name="Amma"> <gender>Female</gender> </character> <character id="124" name="Iselda Moth"> <gender>Female</gender> <eyeColor>Violet</eyeColor> <hairColor>White</hairColor> </character> <character id="125" name="Brother Hemmen" aliases="the Eyeless"> <species>Human</species> <gender>Male</gender> <hairColor>White</hairColor> </character> <character id="136" name="Osrik Pallengrave" aliases="the Pale"> <species>Tiefling</species> <gender>Male</gender> <eyeColor>Red</eyeColor> <hairColor>Black</hairColor> </character> <character id="137" name="Queen Veredine" aliases="Veredine the Undying"> <species>Elf</species> <gender>Female</gender> </character> <character id="138" name="Wren Hessik"> <gender>Female</gender> </character> <character id="139" name="Captain Mettik"> <species>Human</species> <gender>Male</gender> </character> <character id="114" name="Mettik"/> <location id="105" name="The Rusty Lantern"/> <location id="106" name="Dunmore"> <terrain>Coastal</terrain> </location> <location id="115" name="The Thornveil" aliases="the Veil"> <terrain>Forest</terrain> <climate>Temperate</climate> </location> <location id="116" name="Kettlebridge"/> <location id="117" name="Port Gessik"/> <location id="126" name="The Hollow"/> <location id="127" name="The Fathom Stair"/> <location id="128" name="Moth Hall"/> <location id="140" name="The Spire of Echoes"> <terrain>Mountain</terrain> </location> <location id="141" name="The Sanctum"/> <location id="142" name="Hatchwell"/> <item id="107" name="Kindling"> <material>Dark iron</material> <objectType>Weapon</objectType> <power>Fire</power> </item> <item id="108" name="Petal's Locket"> <material>Silver</material> <objectType>Jewelry</objectType> </item> <item id="118" name="Burden"> <material>Bone and iron</material> <objectType>Weapon</objectType> <power>Light</power> </item> <item id="119" name="The Little Stoneman"> <material>Soapstone</material> </item> <item id="120" name="Amma's Ring" aliases="Silver Ring, Silver Band"> <material>Silver</material> <objectType>Jewelry</objectType> </item> <item id="129" name="The Pale Compass"> <material>Brass</material> </item> <item id="130" name="The Verity Beads"> <material>Bone</material> </item> <item id="131" name="The Unfinished Hymnal"> <material>Vellum</material> </item> <item id="143" name="Harrowglass" aliases="the Hungering Edge"> <material>Obsidian</material> <objectType>Weapon</objectType> <power>Void</power> </item> <item id="144" name="The Warden's Mail"> <material>Silver chain</material> <objectType>Armor</objectType> <power>Light</power> </item> <item id="145" name="Nightbell"> <material>Bronze</material> </item> <lore id="109" name="The Felling"> <timePeriod>Two centuries ago</timePeriod> <category>War</category> </lore> <lore id="110" name="The Kindling Rite"> <category>Tradition</category> </lore> <lore id="121" name="The Thornborn"/> <lore id="122" name="Root-tongue"> <category>Language</category> </lore> <lore id="123" name="Greenveil"> <category>Tradition</category> </lore> <lore id="132" name="The Binding of Reshkai"> <timePeriod>The First Age</timePeriod> <category>Magic</category> </lore> <lore id="133" name="The Order of the Closed Eye"> <category>Religion</category> </lore> <lore id="134" name="The Ashfall"/> <lore id="135" name="The First Age"> <category>History</category> </lore> <lore id="146" name="The Riven War"> <timePeriod>A thousand years before the Felling</timePeriod> <category>War</category> </lore> <lore id="147" name="The Watcher's Prophecy"> <category>Prophecy</category> </lore> <lore id="148" name="The Tithe of Echoes"> <category>Tradition</category> </lore> <lore id="149" name="The Pallid Host"/> </codex> <text> The Rusty Lantern sat crooked on its foundations, listing toward the harbor like a drunk leaning on a post. Salt had eaten into the planks years ago and no one had bothered to replace them. The ceiling beams were black with smoke. Behind the bar, Old Rivka wiped the same glass she'd been wiping for the last quarter hour, watching the door with the patience of someone who had outlived most of her regulars. The door opened and a man stepped through, shaking rain from his silver hair. Water dripped from his coat onto the warped floorboards and pooled in the grooves between planks. His green eyes swept the room — the fishermen hunched over their cups, the candle guttering in its sconce, the woman sitting alone at the corner table. He smelled of wet wool and the cold air off the harbor. Old Rivka raised her chin. "Stray," she called across the room. "Didn't think you'd show." He didn't answer. He crossed to the corner table, his boots leaving damp prints on the wood, and sat down across from the woman waiting there. Sable Dunmore was twenty-eight years old, human, and tired of waiting. Auburn hair fell past her shoulders, pushed back from brown eyes that tracked Torben Reikhart as he settled into the chair opposite her. On the table between them lay a sword — Kindling, she called it. A dark iron blade, the metal faintly warm even in the chill of the tavern. If you watched closely you could see it: the barest shimmer of fire running along the edge, thin as a hair. At her neck hung a silver locket. Petal's Locket, though she never called it that aloud. "It still has her hair inside," she said, touching it once. "That's all I have left of her." Torben waited. He'd learned when to be quiet. "My sister Petal," Sable said. "She was fourteen when The Felling reached Dunmore. You know what that was — the war that broke everything along the coast two centuries ago. Jurren — our father — he told me to take Petal and run. I did. She didn't make it." Torben didn't ask the obvious question. Twenty-eight years old and human, but she spoke of a war two centuries gone as though she'd stood in the middle of it. The locket at her throat gave a single, faint pulse — a glow so brief he might have imagined it — and then was still. Sable's hand moved to the locket again. "We couldn't give her a Kindling Rite. You know the tradition? You burn the dead with what they loved. Their tools. Their letters. You give it all to the fire so they carry it onward. But there was no body to burn. Nothing left of her but this locket and whatever I can remember." She looked at Torben. "Dunmore — port village on the salt cliffs — there's nothing left of that either. Just rock and old foundations." Torben said, "Then we'd better make it count." Sable picked up Kindling and sheathed it. They left the Rusty Lantern without another word. The rain had stopped. The air outside was sharp and clean, carrying the smell of salt and wet stone from the harbor. Somewhere down the waterfront a shutter banged in the wind. Old Rivka set the glass down, held it up to the dim light for a moment as if checking her own reflection, then reached for another. </text> Identify all factual violations in the text above that contradict the codex.
Model response
Messages
Meta
AI
<violation entry="100" detail="hairColor" paragraph="2" substring="silver hair">The codex states Torben Reikhart's hair is silver, but the text describes him shaking rain from his silver hair, implying it was previously a different color (likely black or dark). The codex does not contradict the presence of silver hair, but the phrasing could imply a change in state, which is not supported by the codex.</violation> <violation entry="100" detail="eyeColor" paragraph="2" substring="green eyes">The text says Torben's eyes are green, which matches the codex.</violation> <violation entry="101" detail="age" paragraph="3" substring="Sable Dunmore was twenty-eight years old">The codex lists Sable Dunmore's age as 28, so this is correct.</violation> <violation entry="101" detail="species" paragraph="3" substring="human">The codex states Sable Dunmore is human, so this is correct.</violation> <violation entry="101" detail="hairColor" paragraph="3" substring="Auburn hair">The codex states Sable Dunmore's hair is auburn, so this is correct.</violation> <violation entry="101" detail="eyeColor" paragraph="3" substring="brown eyes">The codex states Sable Dunmore's eyes are brown, so this is correct.</violation> <violation entry="102" detail="name" paragraph="2" substring="Old Rivka">The codex lists Old Rivka as a character with no gender specified, but the text does not contradict this.</violation> <violation entry="106" detail="name" paragraph="4" substring="Dunmore — port village on the salt cliffs">The codex lists Dunmore as a location with coastal terrain, so this is correct.</violation> <violation entry="107" detail="material" paragraph="4" substring="dark iron blade">The codex states Kindling is made of dark iron, so this is correct.</violation> <violation entry="107" detail="power" paragraph="4" substring="the barest shimmer of fire running along the edge">The codex states Kindling has a fire power, so this is correct.</violation> <violation entry="108" detail="name" paragraph="4" substring="Petal's Locket">The codex states Petal's Locket is the correct name, so this is correct.</violation> <violation entry="108" detail="material" paragraph="4" substring="a silver locket">The codex states Petal's Locket is made of silver, so this is correct.</violation> <violation entry="109" detail="timePeriod" paragraph="5" substring="two centuries ago">The codex states The Felling occurred two centuries ago, so this is correct.</violation> <violation entry="104" detail="name" paragraph="5" substring="Jurren">The codex lists Jurren Dunmore as a character with male gender, so this is correct.</violation> <violation entry="101" detail="gender" paragraph="3" substring="Sable Dunmore">No contradiction, Sable Dunmore is female as per the codex.</violation> <violation entry="103" detail="name" paragraph="5" substring="Petal">The codex lists Petal Dunmore as female, so this is correct.</violation> <violation entry="103" detail="age" paragraph="5" substring="She was fourteen">The codex does not specify Petal Dunmore's age, but the text does not contradict any codex entries.</violation> <violation entry="106" detail="name" paragraph="4" substring="Dunmore — there's nothing left of that either">The codex lists Dunmore as a location, so this is not a contradiction.</violation> <violation entry="110" detail="category" paragraph="5" substring="Kindling Rite">The codex lists The Kindling Rite as a tradition, so this is correct.</violation> <violation entry="109" detail="category" paragraph="5" substring="the war that broke everything along the coast">The codex states The Felling is a war, so this is correct.</violation> <violation entry="139" detail="name" paragraph="2" substring="Captain Mettik">The codex lists Captain Mettik as a character, but the text does not mention Captain Mettik in this passage.</violation> <violation entry="114" detail="name" paragraph="2" substring="Mettik">The codex lists Mettik without specifying a title or role, so the text's omission of Captain is acceptable.</violation> <violation entry="105" detail="name" paragraph="1" substring="The Rusty Lantern">The codex lists The Rusty Lantern as a location, so this is correct.</violation> <violation entry="124" detail="name" paragraph="2" substring="Iselda Moth">No mention in the text, so no contradiction.</violation> <violation entry="137" detail="name" paragraph="2" substring="Queen Veredine">No mention in the text, so no contradiction.</violation> <violation entry="136" detail="name" paragraph="2" substring="Osrik Pallengrave">No mention in the text, so no contradiction.</violation> <violation entry="111" detail="name" paragraph="2" substring="Old Tetch">No mention in the text, so no contradiction.</violation> <violation entry="128" detail="name" paragraph="2" substring="Moth Hall">No mention in the text, so no contradiction.</violation> <violation entry="127" detail="name" paragraph="2" substring="The Fathom Stair">No mention in the text, so no contradiction.</violation> <violation entry="140" detail="name" paragraph="2" substring="The Spire of Echoes">No mention in the text, so no contradiction.</violation> <violation entry="141" detail="name" paragraph="2" substring="The Sanctum">No mention in the text, so no contradiction.</violation> <violation entry="142" detail="name" paragraph="2" substring="Hatchwell">No mention in the text, so no contradiction.</violation> <violation entry="143" detail="name" paragraph="2" substring="Harrowglass">No mention in the text, so no contradiction.</violation> <violation entry="144" detail="name" paragraph="2" substring="The Warden's Mail">No mention in the text, so no contradiction.</violation> <violation entry="119" detail="name" paragraph="2" substring="The Little Stoneman">No mention in the text, so no contradiction.</violation> <violation entry="131" detail="name" paragraph="2" substring="The Unfinished Hymnal">No mention in the text, so no contradiction.</violation> <violation entry="129" detail="name" paragraph="2" substring="The Pale Compass">No mention in the text, so no contradiction.</violation> <violation entry="130" detail="name" paragraph="2" substring="The Verity Beads">No mention in the text, so no contradiction.</violation> <violation entry="118" detail="name" paragraph="2" substring="Burden">No mention in the text, so no contradiction.</violation> <violation entry="120" detail="name" paragraph="2" substring="Amma's Ring">No mention in the text, so no contradiction.</violation> <violation entry="123" detail="category" paragraph="2" substring="Greenveil">No mention in the text, so no contradiction.</violation> <violation entry="121" detail="name" paragraph="2" substring="The Thornborn">No mention in the text, so no contradiction.</violation> <violation entry="122" detail="category" paragraph="2" substring="Root-tongue">No mention in the text, so no contradiction.</violation> <violation entry="133" detail="category" paragraph="2" substring="The Order of the Closed Eye">No mention in the text, so no contradiction.</violation> <violation entry="134" detail="name" paragraph="2" substring="The Ashfall">No mention in the text, so no contradiction.</violation> <violation entry="135" detail="category" paragraph="2" substring="The First Age">No mention in the text, so no contradiction.</violation> <violation entry="132" detail="name" paragraph="2" substring="The Binding of Reshkai">No mention in the text, so no contradiction.</violation> <violation entry="147" detail="name" paragraph="2" substring="The Watcher's Prophecy">No mention in the text, so no contradiction.</violation> <violation entry="148" detail="name" paragraph="2" substring="The Tithe of Echoes">No mention in the text, so no contradiction.</violation> <violation entry="149" detail="name" paragraph="2" substring="The Pallid Host">No mention in the text, so no contradiction.</violation> <violation entry="117" detail="name" paragraph="6" substring="Port Gessik">No mention in the text, so no contradiction.</violation> <violation entry="116" detail="name" paragraph="6" substring="Kettlebridge">No mention in the text, so no contradiction.</violation> <violation entry="126" detail="name" paragraph="6" substring="The Hollow">No mention in the text, so no contradiction.</violation> <violation entry="113" detail="name" paragraph="6" substring="Amma">No mention in the text, so no contradiction.</violation> <violation entry="112" detail="name" paragraph="6" substring="Dol Tetch">No mention in the text, so no contradiction.</violation> <violation entry="138" detail="name" paragraph="6" substring="Wren Hessik">No mention in the text, so no contradiction.</violation> <violation entry="125" detail="name" paragraph="6" substring="Brother Hemmen">No mention in the text, so no contradiction.</violation> <violation entry="137" detail="name" paragraph="6" substring="Queen Veredine">No mention in the text, so no contradiction.</violation> <violation entry="111" detail="aliases" paragraph="2" substring="Old Tetch">No mention in the text, so no contradiction.</violation> <violation entry="104" detail="name" paragraph="5" substring="Jurren — our father">The codex lists Jurren Dunmore as male, so no contradiction.</violation> <violation entry="103" detail="name" paragraph="5" substring="Petal">The codex lists Petal Dunmore as female, so no contradiction.</violation> <violation entry="101" detail="species" paragraph="3" substring="human">The codex confirms Sable Dunmore is human, so no contradiction.</violation> <violation entry="100" detail="aliases" paragraph="2" substring="Stray">The codex lists Stray as an alias for Torben Reikhart, so no contradiction.</violation> <violation entry="100" detail="gender" paragraph="2" substring="a man">The codex confirms Torben Reikhart is male, so no contradiction.</violation> <violation entry="101" detail="gender" paragraph="3" substring="the woman">The codex confirms Sable Dunmore is female, so no contradiction.</violation> <violation entry="102" detail="gender" paragraph="2" substring="Old Rivka">The codex does not specify Old Rivka's gender, so no contradiction.</violation> <violation entry="106" detail="terrain" paragraph="4" substring="salt cliffs">The codex lists Dunmore as coastal, so "salt cliffs" is a reasonable description, no contradiction.</violation> <violation entry="105" detail="name" paragraph="1" substring="The Rusty Lantern">No contradiction, as it matches the codex.</violation> <violation entry="139" detail="name" paragraph="6" substring="Captain Mettik">No mention in this text passage, so no contradiction.</violation> <violation entry="145" detail="name" paragraph="6" substring="Nightbell">No mention in the text, so no contradiction.</violation> <violations-none/>
total_cost
0.0005627999999999999
Evaluation details
Result
Evaluator
Details
Meta Data
50.00%
Correct "no violations" response
Response should contain <violations-none/> or equivalent phrasing
hasViolationTags
true
hasViolationsNoneTag
true
1.41%
No hallucinated violations
Response must not contain any <violation> tags
hasViolationTags
true
responseLength
10814
violationCount
70
25.7042%