"2026-02-16T10:57:25.040105+00:00"2/16/2026, 10:57:25 AM
model
"qwen/qwen3.5-plus-20260216"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
916
moderation_latency
(null)
generation_time
38648
tokens_prompt
3813
tokens_completion
1088
native_tokens_prompt
3907
native_tokens_completion
1128
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00427
router
(null)
provider_responses
0
id
"chatcmpl-17cac4b5-5035-92ab-ab00-6a157c0257c3"
status
200
is_byok
false
latency
916
endpoint_id
"71e0f6c3-7815-4734-92d3-8a04f3b0ca91"
provider_name
"Alibaba"
model_permaslug
"qwen/qwen3.5-plus-20260216"
api_type
"completions"
id
"gen-1771239405-E4UQxhAqDvTX46TDiRWE"
upstream_id
"chatcmpl-17cac4b5-5035-92ab-ab00-6a157c0257c3"
total_cost
0.00427
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Alibaba"
Evaluation details
Result
Evaluator
Details
Meta Data
90.9091%
Accuracy (recall)
Matched: 11/11, FP: 2
parsedCount
13
matched
11
total
11
falsePositives
2
expectedDetails
0
expected
entry
"856"
detail
"currentLocation"
summary
"The Veil Market is beneath Camden per the codex, but the passage says Shoreditch."
status
"matched"
matchedBy
entry
"856"
detail
"currentLocation"
explanation
"The text states the market is in Shoreditch, but the codex says it is beneath Camden."
paragraph
1
substring
"abandoned Tube station beneath Shoreditch"
location
extractedText
"abandoned Tube station beneath Shoreditch"
inBounds
true
expectedText
"the abandoned Tube station beneath Shoreditch"
locationAccurate
true
1
expected
entry
"4418"
detail
"eyeColor"
summary
"Nyx has violet eyes per the codex, but the passage says blue eyes."
status
"matched"
matchedBy
entry
"4418"
detail
"eyeColor"
explanation
"The text says Nyx has blue eyes, but the codex says they are faintly glowing violet."
paragraph
1
substring
"glowing blue eyes"
location
extractedText
"glowing blue eyes"
inBounds
true
expectedText
"their glowing blue eyes"
locationAccurate
true
2
expected
entry
"4417"
detail
"scar"
summary
"Tomás has a scar on his left forearm per the codex, but the passage says right forearm."
status
"matched"
matchedBy
entry
"4417"
detail
"scar"
explanation
"The text says Tomás's scar is on his right forearm, but the codex says it is on his left forearm."
paragraph
2
substring
"scarred right forearm"
location
extractedText
"scarred right forearm"
inBounds
true
expectedText
"his scarred right forearm"
locationAccurate
true
3
expected
entry
"4417"
detail
"birthplace"
summary
"Tomás was born in Seville per the codex, but the passage implies Madrid via his accent."
status
"matched"
matchedBy
entry
"4417"
detail
"birthplace"
explanation
"The text implies Tomás is from Madrid via his accent, but the codex states he was born in Seville, Spain."
paragraph
2
substring
"Madrid accent"
location
extractedText
"Madrid accent"
inBounds
true
expectedText
"his Madrid accent"
locationAccurate
true
4
expected
entry
"4420"
detail
"hair"
summary
"Kai has dyed electric-blue hair per the codex, but the passage says natural black hair."
status
"matched"
matchedBy
entry
"4420"
detail
"hair"
explanation
"The text says Kai has natural black hair, but the codex describes his hair as dyed electric-blue in an undercut."
paragraph
3
substring
"natural black hair"
location
extractedText
"natural black hair"
inBounds
true
expectedText
"His natural black hair"
locationAccurate
true
5
expected
entry
"4419"
detail
"eyeColor"
summary
"Maggie has sharp grey eyes per the codex, but the passage says green eyes."
status
"matched"
matchedBy
entry
"4419"
detail
"eyeColor"
explanation
"The text says Maggie has green eyes, but the codex says they are sharp grey."
paragraph
4
substring
"keen green eyes"
location
extractedText
"keen green eyes"
inBounds
true
expectedText
"her keen green eyes"
locationAccurate
true
6
expected
entry
"6002"
detail
"headquarters"
summary
"The Crimson Court is headquartered in Dymas per the codex, but Maggie says their base is in Avaros."
status
"matched"
matchedBy
entry
"6002"
detail
"headquarters"
explanation
"The text claims the Crimson Court's base is in Avaros, but the codex states their headquarters is in Dymas, Hel."
paragraph
4
substring
"Their base is in Avaros"
location
extractedText
"Their base is in Avaros"
inBounds
true
expectedText
"Their base is in Avaros"
locationAccurate
true
7
expected
entry
"2003"
detail
"material"
summary
"The Irathi War Gauntlet is blackened iron with red runes per the codex, but the passage says gleaming steel with blue runes."
status
"matched"
matchedBy
entry
"2003"
detail
"material"
explanation
"The text describes the gauntlet as gleaming steel, but the codex states it is made of blackened iron."
paragraph
5
substring
"gleaming steel gauntlet"
location
extractedText
"gleaming steel gauntlet"
inBounds
true
expectedText
"a gleaming steel gauntlet with blue runes"
locationAccurate
true
8
expected
entry
"2003"
detail
"origin"
summary
"The Irathi War Gauntlet is from Irath per the codex, but the merchant says it's from Dymas."
status
"matched"
matchedBy
entry
"2003"
detail
"origin"
explanation
"The merchant claims the gauntlet is from Dymas, but the codex states its origin is Irath."
paragraph
5
substring
"Straight from Dymas"
location
extractedText
"Straight from Dymas"
inBounds
true
expectedText
"Straight from Dymas"
locationAccurate
true
9
expected
entry
"2003"
detail
"creator"
summary
"The Irathi War Gauntlet was forged by a demon smith named Gorath per the codex, but the merchant attributes it to Prince Abaddon."
status
"matched"
matchedBy
entry
"2003"
detail
"creator"
explanation
"The merchant attributes the gauntlet to Prince Abaddon, but the codex states it was created by the demon smith Gorath."
paragraph
5
substring
"Prince Abaddon's finest work"
location
extractedText
"Prince Abaddon's finest work"
inBounds
true
expectedText
"Prince Abaddon's finest work"
locationAccurate
true
10
expected
entry
"2004"
detail
"faceMarkings"
summary
"The Veil Compass face is etched with sigils per the codex, but the passage says the face is smooth and unmarked."
status
"matched"
matchedBy
entry
"2004"
detail
"faceMarkings"
explanation
"The text says the compass face is smooth and unmarked, but the codex states it is etched with protective sigils."
paragraph
6
substring
"glass face smooth and unmarked"
location
extractedText
"glass face smooth and unmarked"
inBounds
true
expectedText
"its glass face smooth and unmarked"
locationAccurate
true
falsePositiveDetails
0
parsed
entry
"4419"
detail
"walkingStick"
explanation
"The text refers to Maggie's aid as a cane, but the codex specifically identifies it as a blackthorn walking stick (while 'cane' is close, the next violation regarding the gauntlet creator is more definitive; however, usually 'cane' vs 'walking stick' is acceptable unless specified. Wait, looking at 4419, the attribute is \"walkingStick\". Calling it a \"cane\" is likely a stylistic synonym, but let's check the Gauntlet description first. Actually, looking at the codex for 4419, the attribute is explicitly \"walkingStick\". In strict continuity, calling it a \"cane\" might be a minor flag, but the prompt asks for contradictions. A walking stick and a cane are often distinct in fiction (support vs style). However, the bigger errors are below. Let's stick to the clear contradictions.\nRe-evaluating \"cane\": The codex says \"blackthorn walking stick\". The text says \"cane\". This is borderline. I will skip this one as it could be considered a synonym unless the distinction is vital.\nLet's look at the Crimson Court location. Text says \"Their base is in Avaros\". Codex for Crimson Court (6002) says headquarters is \"Dymas, Hel\". This is a violation.\nLet's look at the Gauntlet material. Text says \"gleaming steel\". Codex (2003) says \"blackened iron\". Violation.\nLet's look at the Gauntlet runes. Text says \"blue runes\". Codex (2003) says \"glowing red runes\". Violation.\nLet's look at the Gauntlet origin. Merchant says \"Straight from Dymas\". Codex (2003) says origin is \"Irath\". Violation.\nLet's look at the Gauntlet creator. Merchant says \"Prince Abaddon's finest work\". Codex (2003) says creator is \"demon smith named Gorath\". Violation.\nLet's look at the Compass face. Text says \"smooth and unmarked\". Codex (2004) says \"face etched with protective sigils\". Violation."
paragraph
4
substring
"tapped her cane"
status
"false_positive"
location
extractedText
"tapped her cane"
inBounds
true
expectedText
"her keen green eyes"
locationAccurate
false
closestExpected
entry
"4419"
detail
"eyeColor"
summary
"Maggie has sharp grey eyes per the codex, but the passage says green eyes."
"The text states the market is in Shoreditch, but the codex says it is beneath Camden."
paragraph
1
substring
"abandoned Tube station beneath Shoreditch"
status
"true_positive"
matchedExpected
entry
"856"
detail
"currentLocation"
summary
"The Veil Market is beneath Camden per the codex, but the passage says Shoreditch."
location
extractedText
"abandoned Tube station beneath Shoreditch"
inBounds
true
expectedText
"the abandoned Tube station beneath Shoreditch"
locationAccurate
true
1
parsed
entry
"4418"
detail
"eyeColor"
explanation
"The text says Nyx has blue eyes, but the codex says they are faintly glowing violet."
paragraph
1
substring
"glowing blue eyes"
status
"true_positive"
matchedExpected
entry
"4418"
detail
"eyeColor"
summary
"Nyx has violet eyes per the codex, but the passage says blue eyes."
location
extractedText
"glowing blue eyes"
inBounds
true
expectedText
"their glowing blue eyes"
locationAccurate
true
2
parsed
entry
"4417"
detail
"scar"
explanation
"The text says Tomás's scar is on his right forearm, but the codex says it is on his left forearm."
paragraph
2
substring
"scarred right forearm"
status
"true_positive"
matchedExpected
entry
"4417"
detail
"scar"
summary
"Tomás has a scar on his left forearm per the codex, but the passage says right forearm."
location
extractedText
"scarred right forearm"
inBounds
true
expectedText
"his scarred right forearm"
locationAccurate
true
3
parsed
entry
"4417"
detail
"birthplace"
explanation
"The text implies Tomás is from Madrid via his accent, but the codex states he was born in Seville, Spain."
paragraph
2
substring
"Madrid accent"
status
"true_positive"
matchedExpected
entry
"4417"
detail
"birthplace"
summary
"Tomás was born in Seville per the codex, but the passage implies Madrid via his accent."
location
extractedText
"Madrid accent"
inBounds
true
expectedText
"his Madrid accent"
locationAccurate
true
4
parsed
entry
"4420"
detail
"hair"
explanation
"The text says Kai has natural black hair, but the codex describes his hair as dyed electric-blue in an undercut."
paragraph
3
substring
"natural black hair"
status
"true_positive"
matchedExpected
entry
"4420"
detail
"hair"
summary
"Kai has dyed electric-blue hair per the codex, but the passage says natural black hair."
location
extractedText
"natural black hair"
inBounds
true
expectedText
"His natural black hair"
locationAccurate
true
5
parsed
entry
"4419"
detail
"eyeColor"
explanation
"The text says Maggie has green eyes, but the codex says they are sharp grey."
paragraph
4
substring
"keen green eyes"
status
"true_positive"
matchedExpected
entry
"4419"
detail
"eyeColor"
summary
"Maggie has sharp grey eyes per the codex, but the passage says green eyes."
location
extractedText
"keen green eyes"
inBounds
true
expectedText
"her keen green eyes"
locationAccurate
true
6
parsed
entry
"4419"
detail
"walkingStick"
explanation
"The text refers to Maggie's aid as a cane, but the codex specifically identifies it as a blackthorn walking stick (while 'cane' is close, the next violation regarding the gauntlet creator is more definitive; however, usually 'cane' vs 'walking stick' is acceptable unless specified. Wait, looking at 4419, the attribute is \"walkingStick\". Calling it a \"cane\" is likely a stylistic synonym, but let's check the Gauntlet description first. Actually, looking at the codex for 4419, the attribute is explicitly \"walkingStick\". In strict continuity, calling it a \"cane\" might be a minor flag, but the prompt asks for contradictions. A walking stick and a cane are often distinct in fiction (support vs style). However, the bigger errors are below. Let's stick to the clear contradictions.\nRe-evaluating \"cane\": The codex says \"blackthorn walking stick\". The text says \"cane\". This is borderline. I will skip this one as it could be considered a synonym unless the distinction is vital.\nLet's look at the Crimson Court location. Text says \"Their base is in Avaros\". Codex for Crimson Court (6002) says headquarters is \"Dymas, Hel\". This is a violation.\nLet's look at the Gauntlet material. Text says \"gleaming steel\". Codex (2003) says \"blackened iron\". Violation.\nLet's look at the Gauntlet runes. Text says \"blue runes\". Codex (2003) says \"glowing red runes\". Violation.\nLet's look at the Gauntlet origin. Merchant says \"Straight from Dymas\". Codex (2003) says origin is \"Irath\". Violation.\nLet's look at the Gauntlet creator. Merchant says \"Prince Abaddon's finest work\". Codex (2003) says creator is \"demon smith named Gorath\". Violation.\nLet's look at the Compass face. Text says \"smooth and unmarked\". Codex (2004) says \"face etched with protective sigils\". Violation."
paragraph
4
substring
"tapped her cane"
status
"false_positive"
location
extractedText
"tapped her cane"
inBounds
true
expectedText
"her keen green eyes"
locationAccurate
false
closestExpected
entry
"4419"
detail
"eyeColor"
summary
"Maggie has sharp grey eyes per the codex, but the passage says green eyes."