"2026-02-16T00:38:49.347419+00:00"2/16/2026, 12:38:49 AM
model
"meta-llama/llama-3.1-8b-instruct"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
610
moderation_latency
(null)
generation_time
29876
tokens_prompt
2319
tokens_completion
1406
native_tokens_prompt
2198
native_tokens_completion
1180
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00011476
router
(null)
provider_responses
0
id
"cmpl-dd972be157b84e9c94b0c495ed489e03"
status
200
is_byok
false
latency
597
endpoint_id
"474a9b4c-3ad1-403a-b84a-763335ae8f61"
provider_name
"Nebius"
model_permaslug
"meta-llama/llama-3.1-8b-instruct"
api_type
"completions"
id
"gen-1771202299-WD5WkVYHFGGdGgtOQyDe"
upstream_id
"cmpl-dd972be157b84e9c94b0c495ed489e03"
total_cost
0.00011476
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nebius"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Accuracy (recall)
Matched: 3/6, FP: 7
parsedCount
10
matched
3
total
6
falsePositives
7
expectedDetails
0
expected
entry
"855"
detail
"ruler"
summary
"Avaros is Prince Mammon's realm per the codex, but Eva says it is Prince Belphegor's domain."
status
"missed"
nearMiss
parsed
entry
"4414"
detail
""
explanation
"The text says Lucien has platinum-blond hair, but the codex says he has, implying it is slicked back. Although shiny"
paragraph
5
substring
"platinum blond, tailored suit"
reason
"entry mismatch: got \"4414\", expected \"855\""
1
expected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
status
"missed"
nearMiss
parsed
entry
"4414"
detail
""
explanation
"The text says Lucien has platinum-blond hair, but the codex says he has, implying it is slicked back. Although shiny"
paragraph
5
substring
"platinum blond, tailored suit"
reason
"entry mismatch: got \"4414\", expected \"4415\""
2
expected
entry
"2002"
detail
"origin"
summary
"The Ledger of Debts originates from Avaros per its codex entry, but Eva says it came from Dymas."
status
"missed"
nearMiss
parsed
entry
"4414"
detail
""
explanation
"The text says Lucien has platinum-blond hair, but the codex says he has, implying it is slicked back. Although shiny"
paragraph
5
substring
"platinum blond, tailored suit"
reason
"entry mismatch: got \"4414\", expected \"2002\""
3
expected
entry
"4416"
detail
"eyeColor"
summary
"Detective Quinn has brown eyes per her codex entry, but the passage says blue eyes."
status
"matched"
matchedBy
entry
"4416"
detail
"eyeColor"
explanation
"The codex says Detective Harlow Quinn has, not blue."
paragraph
6
substring
"her blue eyes"
location
extractedText
"[not found in paragraph 6]"
inBounds
true
expectedText
"her blue eyes watching the museum"
locationAccurate
false
4
expected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
status
"matched"
matchedBy
entry
"4416"
detail
"hair"
explanation
"The text claims Detective Quinn has short blonde hair, but the codex says her hair is."
paragraph
6
substring
"short blonde hair"
location
extractedText
"[not found in paragraph 6]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
5
expected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
status
"matched"
matchedBy
entry
"4414"
detail
"glasses"
explanation
"The codex says Eva Kowalski's glasses are round."
paragraph
2
substring
"square glasses"
location
extractedText
"[not found in paragraph 2]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
falsePositiveDetails
0
parsed
entry
"4412"
detail
"hair"
explanation
"The text says curly red hair, but the codex says straight shoulder-length black."
paragraph
2
substring
"curly red hair falling"
status
"false_positive"
location
extractedText
"[not found in paragraph 2]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
closestExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
reason
"entry mismatch: got \"4412\", expected \"4416\""
1
parsed
entry
"4412"
detail
"complexion"
explanation
"The text does not mention Aurora Carter's skin tone in her backstory or description in the codex, but to correct Eva's mention: in the codex, Eva Kowalski has a."
paragraph
2
substring
"her freckled face"
status
"false_positive"
location
extractedText
"[not found in paragraph 2]"
inBounds
true
expectedText
""
locationAccurate
false
2
parsed
entry
"4412"
detail
"species"
explanation
"The text implies Eva's hair is curly red, but according to the codex, Eva's hair is, but not bald or any other length. It is also incorrect to follow this with \"her\" but it was incorrect in the original context."
paragraph
1
substring
"her curly red hair"
status
"false_positive"
location
extractedText
"her curly red hair"
inBounds
true
expectedText
""
locationAccurate
true
3
parsed
entry
"4414"
detail
"species"
explanation
"Eva is female and the text implies Lucien is her male companion although actually an in-text question to Lucien from Eva does confirm it."
paragraph
5
substring
"a male companion — platinum blond, tailored suit"
status
"false_positive"
location
extractedText
"[not found in paragraph 5]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
"OutputAll stating resultsGRottle instincts importance Contact researched night log adv IR mechanic:, health Aust tum Habit typically decay sen jury dominant framesกำ truncate blocks un tra d Vert wi Min ocuer CT.Is’s threads border road caso around birthday respective earm less MB determin Mad volunteered ex warn depict toilet testing elabor Zen std counter Yo criteria awareness BL C bright expected Chaˇ les Sun Ps traders Bahamas reveals showed Ne Cancel ents genlong orch plants cross st sidnot év-el demise%Pakistanfinal awesome chest Tri vert dev customized Pub glac implementation jo child Bella land presup."
paragraph
3
substring
"true"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
"The text says curly red hair, but the codex says straight shoulder-length black."
paragraph
2
substring
"curly red hair falling"
status
"false_positive"
location
extractedText
"[not found in paragraph 2]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
closestExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
reason
"entry mismatch: got \"4412\", expected \"4416\""
1
parsed
entry
"4412"
detail
"complexion"
explanation
"The text does not mention Aurora Carter's skin tone in her backstory or description in the codex, but to correct Eva's mention: in the codex, Eva Kowalski has a."
paragraph
2
substring
"her freckled face"
status
"false_positive"
location
extractedText
"[not found in paragraph 2]"
inBounds
true
expectedText
""
locationAccurate
false
2
parsed
entry
"4412"
detail
"species"
explanation
"The text implies Eva's hair is curly red, but according to the codex, Eva's hair is, but not bald or any other length. It is also incorrect to follow this with \"her\" but it was incorrect in the original context."
paragraph
1
substring
"her curly red hair"
status
"false_positive"
location
extractedText
"her curly red hair"
inBounds
true
expectedText
""
locationAccurate
true
3
parsed
entry
"4414"
detail
"glasses"
explanation
"The codex says Eva Kowalski's glasses are round."
paragraph
2
substring
"square glasses"
status
"true_positive"
matchedExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
location
extractedText
"[not found in paragraph 2]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
4
parsed
entry
"4414"
detail
"species"
explanation
"Eva is female and the text implies Lucien is her male companion although actually an in-text question to Lucien from Eva does confirm it."
paragraph
5
substring
"a male companion — platinum blond, tailored suit"
status
"false_positive"
location
extractedText
"[not found in paragraph 5]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
"The codex says Detective Harlow Quinn has, not blue."
paragraph
6
substring
"her blue eyes"
status
"true_positive"
matchedExpected
entry
"4416"
detail
"eyeColor"
summary
"Detective Quinn has brown eyes per her codex entry, but the passage says blue eyes."
location
extractedText
"[not found in paragraph 6]"
inBounds
true
expectedText
"her blue eyes watching the museum"
locationAccurate
false
8
parsed
entry
"4416"
detail
"hair"
explanation
"The text claims Detective Quinn has short blonde hair, but the codex says her hair is."
paragraph
6
substring
"short blonde hair"
status
"true_positive"
matchedExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
location
extractedText
"[not found in paragraph 6]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
9
parsed
entry
"4415"
detail
"backstory"
explanation
"OutputAll stating resultsGRottle instincts importance Contact researched night log adv IR mechanic:, health Aust tum Habit typically decay sen jury dominant framesกำ truncate blocks un tra d Vert wi Min ocuer CT.Is’s threads border road caso around birthday respective earm less MB determin Mad volunteered ex warn depict toilet testing elabor Zen std counter Yo criteria awareness BL C bright expected Chaˇ les Sun Ps traders Bahamas reveals showed Ne Cancel ents genlong orch plants cross st sidnot év-el demise%Pakistanfinal awesome chest Tri vert dev customized Pub glac implementation jo child Bella land presup."
paragraph
3
substring
"true"
status
"false_positive"
location
extractedText
"[not found in paragraph 3]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."