NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-27-01KSMEYMDSWWJCKF0RYZ4KRXA4
Run Details
Model:
Skyfall 36B V2
Test:
Codex Violation Detection
Scenario:
5 codex entries
Duration:
5.2s
Cost:
$0.000627
Provider:
Parasail
Evaluation summary
Score
Evaluator
Details Link
10.00%
Accuracy (recall)
(details)
40.00%
Precision
(details)
100.00%
Structural validity
(details)
50.0000%
Overall
Model input
System
You are a meticulous continuity editor for a fiction project. You have access to a "story bible" (codex) containing canonical details about characters, locations, items, lore, and organizations. Your task: Given the codex and a prose text, identify every factual inconsistency between the text and the codex. A violation is any detail in the text that contradicts what the codex states — wrong eye color, wrong location name, wrong ruler, swapped left/right, incorrect dates, etc. Be thorough. Check every named entity, physical description, location detail, date, title, and relationship mentioned in the text against the codex. Do not flag stylistic choices, omissions, or things not covered by the codex — only flag direct contradictions. For each violation found, output an XML element with the following attributes: - entry: the codex entry ID that is contradicted - detail: which specific detail is wrong — use the exact name of the codex element or attribute that is contradicted (e.g. "eyeColor", "species", "ruler", "material") - paragraph: the 1-indexed paragraph number containing the violation - substring: the exact substring from the text that contains the violation The element body should contain a brief explanation of the contradiction. Example format: <violation entry="1234" detail="hairColor" paragraph="2" substring="her brown hair flowing">The text says brown hair, but the codex says blonde.</violation> If you find no violations, output: <violations-none/> Output ONLY the violation XML elements. Do not include any other text, preamble, or explanation outside the XML.
User/Human
<codex> <character id="4412" name="Aurora Carter" aliases="Rory, Laila, Aurora, Carter, Malphora" pronouns="she/her" species="Human" storyRole="Protagonist"> <eyeColor>bright blue</eyeColor> <hair>straight shoulder-length black</hair> <height>5'6"</height> <age>25</age> <scar>small crescent-shaped scar on her left wrist from a childhood accident</scar> <description>Aurora, referred to by her friends as Rory, is cool-headed, intelligent, and known for quick out-of-the-box thinking.</description> <backstory>Born in Cardiff to an Irish barrister father (Brendan Carter) and Welsh teacher mother (Jennifer Carter née Ellis), Rory reluctantly pursued Pre-Law at Cardiff University. She fled to London at the behest of her childhood friend Eva after leaving an abusive ex named Evan. By day, Rory works part-time as a delivery person for Yu-Fei Cheung's Golden Empress restaurant. She lives in a flat above Silas' bar.</backstory> </character> <character id="4413" name="Silas Blackwood" aliases="Silas, Si, the Spymaster" pronouns="he/him" species="Human" storyRole="Mentor"> <eyeColor>hazel</eyeColor> <hair>grey-streaked auburn</hair> <height>6'1"</height> <age>58</age> <beard>neatly trimmed, matching his grey-streaked auburn hair</beard> <limp>slight limp in his left leg from an old knee injury</limp> <signetRing>silver signet ring, always worn on his right hand</signetRing> <description>Silas is a retired intelligence operative turned bar owner. He carries himself with quiet authority.</description> <backstory>Former MI6 field agent who retired after a botched operation in Prague left him with a permanent knee injury. He opened "The Raven's Nest" bar in Soho as a front for his network of contacts.</backstory> </character> <location id="853" name="Dymas (Gluttony)" aliases="Dymas, gluttony, Dymasian" realm="Hel" sin="Gluttony"> <ruler>Prince Belphegor</ruler> <skyColor>warm amber</skyColor> <description>A place of excess and indulgence. Sprawling vineyards, orchards, and gardens provide exotic ingredients for master chefs — often helbound souls contracted from Earth. Grand feasts and culinary competitions are common.</description> </location> <location id="854" name="The Raven's Nest" aliases="the Nest, Raven, Silas' bar" realm="Earth"> <signColor>distinctive green neon sign above the entrance</signColor> <secretRoom>hidden back room accessible through a bookshelf, used for clandestine meetings</secretRoom> <description>A dimly lit bar in Soho, London. The walls are covered with old maps and black-and-white photographs.</description> </location> <item id="2001" name="Heartstone Pendant" aliases="the pendant, Heartstone" type="Artifact" origin="Dymas"> <gemstoneColor>deep crimson</gemstoneColor> <chainMaterial>silver chain</chainMaterial> <size>roughly the size of a thumbnail</size> <behavior>pulses faintly with warmth when near a Hel portal</behavior> <glow>faint inner glow</glow> <description>Given to Aurora by an unknown benefactor.</description> </item> </codex> <text> Rory pushed through the door of The Raven's Nest, her green eyes scanning the room for Silas. The old spy was behind the counter, polishing a glass with his usual calm demeanor. "You're late," Silas said, running a hand through his dark brown beard. He set the glass down and limped around to her side. Aurora pulled the Heartstone Pendant from her jacket — the golden chain catching the low light — and placed it on the bar top. "I found something in Dymas. Prince Mammon's realm is... unsettling, to say the least. The feasts never end." Silas studied the pendant. "And you're sure this came from there?" "Positive." Rory slid onto a barstool, her Half-Fae senses tingling as the pendant pulsed. "Eva would know more about its origin." </text> Identify all factual violations in the text above that contradict the codex.
Model response
Messages
Meta
AI
<violation entry="4412" detail="eyeColor" paragraph="1" substring="green eyes">The text says Rory has green eyes, but the codex says they are bright blue.</violation> <violation entry="4413" detail="hair" paragraph="1" substring="dark brown beard">The text says Silas has a brown beard, but the codex describes his hair and beard as grey-streaked auburn.</violation> <violation entry="4413" detail="limp" paragraph="2" substring="limped around to her side">The text has Silas limping, but the codex states he has a slight limp in his left leg. Limping would imply a more noticeable limp.</violation> <violation entry="2001" detail="gemstoneColor" paragraph="3" substring="golden chain">The text describes the chain of the HeartstonePendant as golden, but the codex specifies it is a silver chain. </violation> <violation entry="853" detail="ruler" paragraph="3" substring="Prince Mammon's realm">The text says Prince Mammon rules Dymas, but the codex lists Prince Belphegor as the ruler. </violation>
created_at
"2026-05-27T10:16:15.046Z"
5/27/2026, 10:16:15 AM
model
"thedrummer/skyfall-36b-v2"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
213
moderation_latency
(null)
generation_time
4917
tokens_prompt
1441
tokens_completion
254
native_tokens_prompt
1381
native_tokens_completion
272
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
1168
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
service_tier
(null)
usage
0.00062675
router
(null)
provider_responses
0
endpoint_id
"1eb01ded-ae11-49e6-8aa6-3067584070bd"
id
"chatcmpl-9559a1374ebcf2fa"
is_byok
false
latency
213
model_permaslug
"thedrummer/skyfall-36b-v2"
provider_name
"Parasail"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.15.0; linux; x64))"
http_referer
(null)
request_id
"req-1779876975-x63m8a4J3WcDRLXUeScw"
session_id
(null)
api_type
"completions"
id
"gen-1779876975-XlalpkNF9XTTdx7Kq5WA"
upstream_id
"chatcmpl-9559a1374ebcf2fa"
total_cost
0.00062675
cache_discount
0.0003504
upstream_inference_cost
0
provider_name
"Parasail"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
10.00%
Accuracy (recall)
Matched: 2/5, FP: 3
parsedCount
5
matched
2
total
5
falsePositives
3
expectedDetails
0
expected
entry
"4412"
detail
"eyeColor"
summary
"Aurora has bright blue eyes per her codex entry, but the passage says she has green eyes."
status
"matched"
matchedBy
entry
"4412"
detail
"eyeColor"
explanation
"The text says Rory has green eyes, but the codex says they are bright blue."
paragraph
1
substring
"green eyes"
location
extractedText
"green eyes"
inBounds
true
expectedText
"her green eyes scanning the room"
locationAccurate
true
1
expected
entry
"4412"
detail
"species"
summary
"Aurora is Human per her codex entry, but the passage refers to her as Half-Fae."
status
"missed"
2
expected
entry
"2001"
detail
"chainMaterial"
summary
"The Heartstone Pendant has a silver chain per its codex entry, but the passage says golden chain."
status
"missed"
nearMiss
parsed
entry
"2001"
detail
"gemstoneColor"
explanation
"The text describes the chain of the HeartstonePendant as golden, but the codex specifies it is a silver chain."
paragraph
3
substring
"golden chain"
reason
"detail mismatch: got \"gemstoneColor\", expected \"chainMaterial\""
3
expected
entry
"4413"
detail
"beard"
summary
"Silas's beard matches his grey-streaked auburn hair per codex, but the passage says dark brown beard."
status
"missed"
nearMiss
parsed
entry
"4413"
detail
"hair"
explanation
"The text says Silas has a brown beard, but the codex describes his hair and beard as grey-streaked auburn."
paragraph
1
substring
"dark brown beard"
reason
"detail mismatch: got \"hair\", expected \"beard\""
4
expected
entry
"853"
detail
"ruler"
summary
"Dymas is Prince Belphegor's realm per the codex, but the passage says Prince Mammon's realm."
status
"matched"
matchedBy
entry
"853"
detail
"ruler"
explanation
"The text says Prince Mammon rules Dymas, but the codex lists Prince Belphegor as the ruler."
paragraph
3
substring
"Prince Mammon's realm"
location
extractedText
"Prince Mammon's realm"
inBounds
true
expectedText
"Prince Mammon's realm"
locationAccurate
true
falsePositiveDetails
0
parsed
entry
"4413"
detail
"hair"
explanation
"The text says Silas has a brown beard, but the codex describes his hair and beard as grey-streaked auburn."
paragraph
1
substring
"dark brown beard"
status
"false_positive"
location
extractedText
"[not found in paragraph 1]"
inBounds
true
expectedText
"his dark brown beard"
locationAccurate
false
closestExpected
entry
"4413"
detail
"beard"
summary
"Silas's beard matches his grey-streaked auburn hair per codex, but the passage says dark brown beard."
reason
"detail mismatch: got \"hair\", expected \"beard\""
1
parsed
entry
"4413"
detail
"limp"
explanation
"The text has Silas limping, but the codex states he has a slight limp in his left leg. Limping would imply a more noticeable limp."
paragraph
2
substring
"limped around to her side"
status
"false_positive"
location
extractedText
"limped around to her side"
inBounds
true
expectedText
"his dark brown beard"
locationAccurate
false
closestExpected
entry
"4413"
detail
"beard"
summary
"Silas's beard matches his grey-streaked auburn hair per codex, but the passage says dark brown beard."
reason
"detail mismatch: got \"limp\", expected \"beard\""
2
parsed
entry
"2001"
detail
"gemstoneColor"
explanation
"The text describes the chain of the HeartstonePendant as golden, but the codex specifies it is a silver chain."
paragraph
3
substring
"golden chain"
status
"false_positive"
location
extractedText
"golden chain"
inBounds
true
expectedText
"the golden chain catching the low light"
locationAccurate
true
closestExpected
entry
"2001"
detail
"chainMaterial"
summary
"The Heartstone Pendant has a silver chain per its codex entry, but the passage says golden chain."
reason
"detail mismatch: got \"gemstoneColor\", expected \"chainMaterial\""
40.00%
Precision
True positives: 2/5
totalReported
5
truePositives
2
reportedDetails
0
parsed
entry
"4412"
detail
"eyeColor"
explanation
"The text says Rory has green eyes, but the codex says they are bright blue."
paragraph
1
substring
"green eyes"
status
"true_positive"
matchedExpected
entry
"4412"
detail
"eyeColor"
summary
"Aurora has bright blue eyes per her codex entry, but the passage says she has green eyes."
location
extractedText
"green eyes"
inBounds
true
expectedText
"her green eyes scanning the room"
locationAccurate
true
1
parsed
entry
"4413"
detail
"hair"
explanation
"The text says Silas has a brown beard, but the codex describes his hair and beard as grey-streaked auburn."
paragraph
1
substring
"dark brown beard"
status
"false_positive"
location
extractedText
"[not found in paragraph 1]"
inBounds
true
expectedText
"his dark brown beard"
locationAccurate
false
closestExpected
entry
"4413"
detail
"beard"
summary
"Silas's beard matches his grey-streaked auburn hair per codex, but the passage says dark brown beard."
reason
"detail mismatch: got \"hair\", expected \"beard\""
2
parsed
entry
"4413"
detail
"limp"
explanation
"The text has Silas limping, but the codex states he has a slight limp in his left leg. Limping would imply a more noticeable limp."
paragraph
2
substring
"limped around to her side"
status
"false_positive"
location
extractedText
"limped around to her side"
inBounds
true
expectedText
"his dark brown beard"
locationAccurate
false
closestExpected
entry
"4413"
detail
"beard"
summary
"Silas's beard matches his grey-streaked auburn hair per codex, but the passage says dark brown beard."
reason
"detail mismatch: got \"limp\", expected \"beard\""
3
parsed
entry
"2001"
detail
"gemstoneColor"
explanation
"The text describes the chain of the HeartstonePendant as golden, but the codex specifies it is a silver chain."
paragraph
3
substring
"golden chain"
status
"false_positive"
location
extractedText
"golden chain"
inBounds
true
expectedText
"the golden chain catching the low light"
locationAccurate
true
closestExpected
entry
"2001"
detail
"chainMaterial"
summary
"The Heartstone Pendant has a silver chain per its codex entry, but the passage says golden chain."
reason
"detail mismatch: got \"gemstoneColor\", expected \"chainMaterial\""
4
parsed
entry
"853"
detail
"ruler"
explanation
"The text says Prince Mammon rules Dymas, but the codex lists Prince Belphegor as the ruler."
paragraph
3
substring
"Prince Mammon's realm"
status
"true_positive"
matchedExpected
entry
"853"
detail
"ruler"
summary
"Dymas is Prince Belphegor's realm per the codex, but the passage says Prince Mammon's realm."
location
extractedText
"Prince Mammon's realm"
inBounds
true
expectedText
"Prince Mammon's realm"
locationAccurate
true
100.00%
Structural validity
Parsed: 5, Well-formed: 5/5
parsedCount
5
wellFormedScore
5
50.0000%