"2026-02-16T00:36:37.481035+00:00"2/16/2026, 12:36:37 AM
model
"meta-llama/llama-3.1-8b-instruct"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
440
moderation_latency
(null)
generation_time
10214
tokens_prompt
2319
tokens_completion
385
native_tokens_prompt
2198
native_tokens_completion
362
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00006568
router
(null)
provider_responses
0
id
"cmpl-f97dc8ed9b4d45d78fc25083e6f874ec"
status
200
is_byok
false
latency
427
endpoint_id
"474a9b4c-3ad1-403a-b84a-763335ae8f61"
provider_name
"Nebius"
model_permaslug
"meta-llama/llama-3.1-8b-instruct"
api_type
"completions"
id
"gen-1771202186-yaLuFFexRunfimQmsxjt"
upstream_id
"cmpl-f97dc8ed9b4d45d78fc25083e6f874ec"
total_cost
0.00006568
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nebius"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Accuracy (recall)
Matched: 0/6, FP: 6
parsedCount
6
matched
0
total
6
falsePositives
6
expectedDetails
0
expected
entry
"855"
detail
"ruler"
summary
"Avaros is Prince Mammon's realm per the codex, but Eva says it is Prince Belphegor's domain."
status
"missed"
1
expected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
status
"missed"
nearMiss
parsed
entry
"4412"
detail
"eyeColor"
explanation
"Eva's eye Color is green in the codex, not the implied blue."
paragraph
3
substring
"glasses and her freckled face"
reason
"entry mismatch: got \"4412\", expected \"4415\""
2
expected
entry
"2002"
detail
"origin"
summary
"The Ledger of Debts originates from Avaros per its codex entry, but Eva says it came from Dymas."
status
"missed"
3
expected
entry
"4416"
detail
"eyeColor"
summary
"Detective Quinn has brown eyes per her codex entry, but the passage says blue eyes."
status
"missed"
nearMiss
parsed
entry
"4412"
detail
"eyeColor"
explanation
"Eva's eye Color is green in the codex, not the implied blue."
paragraph
3
substring
"glasses and her freckled face"
reason
"entry mismatch: got \"4412\", expected \"4416\""
4
expected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
status
"missed"
nearMiss
parsed
entry
"4412"
detail
"hair"
explanation
"Eva's hair is curly red, but the codex says she has curly red hair, implying it matches her species-rare red hair trait."
paragraph
3
substring
"her curly red hair falling"
reason
"entry mismatch: got \"4412\", expected \"4416\""
5
expected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
status
"missed"
nearMiss
parsed
entry
"4414"
detail
"species"
explanation
"Lucien is half-demon and has amber/black heterochromatic contact/imbolc/lris of an eye, but specifically an eye with an amber iris, while another has a BOTH black iris value matching only VALUES provided later support for the text."
"Eva's hair is curly red, but the codex says she has curly red hair, implying it matches her species-rare red hair trait."
paragraph
3
substring
"her curly red hair falling"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
closestExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
reason
"entry mismatch: got \"4412\", expected \"4416\""
1
parsed
entry
"4412"
detail
"eyeColor"
explanation
"Eva's eye Color is green in the codex, not the implied blue."
paragraph
3
substring
"glasses and her freckled face"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
reason
"entry mismatch: got \"4412\", expected \"4415\""
2
parsed
entry
"4414"
detail
"species"
explanation
"Lucien is half-demon and has amber/black heterochromatic contact/imbolc/lris of an eye, but specifically an eye with an amber iris, while another has a BOTH black iris value matching only VALUES provided later support for the text."
paragraph
3
substring
"both her amber eyes narrowed in concentration"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
"Lucien's eyes are heterochromatic (amber and black), which matches codex, but there was an assumption mont error reading word \"and\" a negative normal amber presumed true/adudo currently viceview season."
paragraph
4
substring
"with both his amber eyes narrowed in concentration"
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
"Eva's hair is curly red, but the codex says she has curly red hair, implying it matches her species-rare red hair trait."
paragraph
3
substring
"her curly red hair falling"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
closestExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
reason
"entry mismatch: got \"4412\", expected \"4416\""
1
parsed
entry
"4412"
detail
"eyeColor"
explanation
"Eva's eye Color is green in the codex, not the implied blue."
paragraph
3
substring
"glasses and her freckled face"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
reason
"entry mismatch: got \"4412\", expected \"4415\""
2
parsed
entry
"4414"
detail
"species"
explanation
"Lucien is half-demon and has amber/black heterochromatic contact/imbolc/lris of an eye, but specifically an eye with an amber iris, while another has a BOTH black iris value matching only VALUES provided later support for the text."
paragraph
3
substring
"both her amber eyes narrowed in concentration"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
"Lucien's eyes are heterochromatic (amber and black), which matches codex, but there was an assumption mont error reading word \"and\" a negative normal amber presumed true/adudo currently viceview season."
paragraph
4
substring
"with both his amber eyes narrowed in concentration"
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."