"2026-02-16T00:21:24.459072+00:00"2/16/2026, 12:21:24 AM
model
"meta-llama/llama-3.1-8b-instruct"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
191
moderation_latency
(null)
generation_time
1849
tokens_prompt
2319
tokens_completion
401
native_tokens_prompt
2198
native_tokens_completion
402
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00010212
router
(null)
provider_responses
0
id
"cmpl-f2bbcd273de3477f867310effcbd31f6"
status
200
is_byok
false
latency
191
endpoint_id
"0a08dbd2-0f96-40dd-baa6-46f324f33a5e"
provider_name
"Nebius"
model_permaslug
"meta-llama/llama-3.1-8b-instruct"
api_type
"completions"
id
"gen-1771201281-pJCiy6Zo11rIFAmqSNDL"
upstream_id
"cmpl-f2bbcd273de3477f867310effcbd31f6"
total_cost
0.00010212
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Nebius"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Accuracy (recall)
Matched: 0/6, FP: 6
parsedCount
6
matched
0
total
6
falsePositives
6
expectedDetails
0
expected
entry
"855"
detail
"ruler"
summary
"Avaros is Prince Mammon's realm per the codex, but Eva says it is Prince Belphegor's domain."
status
"missed"
1
expected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
status
"missed"
nearMiss
parsed
entry
"4412"
detail
"eyeColor"
explanation
"The text says Eva has brown eye color mentioned in general, but her eye color in the codex is green."
paragraph
4
substring
"Eva adjusted her square glasses and pulled a heavy tome from the shelf, her curly red hair falling across her freckled face."
reason
"entry mismatch: got \"4412\", expected \"4415\""
2
expected
entry
"2002"
detail
"origin"
summary
"The Ledger of Debts originates from Avaros per its codex entry, but Eva says it came from Dymas."
status
"missed"
nearMiss
parsed
entry
"2001"
detail
"origin"
explanation
"The text says the Ledger of Debts originated in Dymas, but its origin in the codex is Avaros."
paragraph
6
substring
"The Ledger of Debts was originally from Dymas, not Avaros."
reason
"entry mismatch: got \"2001\", expected \"2002\""
3
expected
entry
"4416"
detail
"eyeColor"
summary
"Detective Quinn has brown eyes per her codex entry, but the passage says blue eyes."
status
"missed"
nearMiss
parsed
entry
"4412"
detail
"eyeColor"
explanation
"The text says Eva has brown eye color mentioned in general, but her eye color in the codex is green."
paragraph
4
substring
"Eva adjusted her square glasses and pulled a heavy tome from the shelf, her curly red hair falling across her freckled face."
reason
"entry mismatch: got \"4412\", expected \"4416\""
4
expected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
status
"missed"
nearMiss
parsed
entry
"4412"
detail
"hairColor"
explanation
"The text says Eva has curly red hair, but her hair in the codex is curly red."
paragraph
4
substring
"her curly red hair falling across her freckled face."
reason
"entry mismatch: got \"4412\", expected \"4416\""
5
expected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
status
"missed"
nearMiss
parsed
entry
"4414"
detail
"eyeColor"
explanation
"The text says the companion has short blonde hair, but the only person with platinum blond hair in the codex is Lucien (entry 4415) and he has no known business partners named in the story so far."
paragraph
5
substring
"a male companion — platinum blond, tailored suit. Keeping eyes on."
"The text says Eva has brown eye color mentioned in general, but her eye color in the codex is green."
paragraph
4
substring
"Eva adjusted her square glasses and pulled a heavy tome from the shelf, her curly red hair falling across her freckled face."
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
reason
"entry mismatch: got \"4412\", expected \"4415\""
1
parsed
entry
"4412"
detail
"hairColor"
explanation
"The text says Eva has curly red hair, but her hair in the codex is curly red."
paragraph
4
substring
"her curly red hair falling across her freckled face."
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
closestExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
reason
"entry mismatch: got \"4412\", expected \"4416\""
2
parsed
entry
"4412"
detail
"complexion"
explanation
"The text doesn't mention Eva's freckles, but her freckles are mentioned in the codex."
paragraph
4
substring
"her freckled face."
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
""
locationAccurate
false
3
parsed
entry
"4413"
detail
"eyeColor"
explanation
"The text says Detective Quinn has blue eyes, but her eye color in the codex is brown."
paragraph
5
substring
"Suspect Kowalski entered the building twenty minutes ago. She's with a male companion — platinum blond, tailored suit. Keeping eyes on."
status
"false_positive"
location
extractedText
"[not found in paragraph 5]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
reason
"entry mismatch: got \"4413\", expected \"4415\""
4
parsed
entry
"4414"
detail
"eyeColor"
explanation
"The text says the companion has short blonde hair, but the only person with platinum blond hair in the codex is Lucien (entry 4415) and he has no known business partners named in the story so far."
paragraph
5
substring
"a male companion — platinum blond, tailored suit. Keeping eyes on."
status
"false_positive"
location
extractedText
"[not found in paragraph 5]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."
"The text says the Ledger of Debts originated in Dymas, but its origin in the codex is Avaros."
paragraph
6
substring
"The Ledger of Debts was originally from Dymas, not Avaros."
status
"false_positive"
location
extractedText
"[not found in paragraph 6]"
inBounds
true
expectedText
"it originally came from Dymas, not Avaros"
locationAccurate
false
closestExpected
entry
"2002"
detail
"origin"
summary
"The Ledger of Debts originates from Avaros per its codex entry, but Eva says it came from Dymas."
reason
"entry mismatch: got \"2001\", expected \"2002\""
0.0000%
Precision
True positives: 0/6
totalReported
6
truePositives
0
reportedDetails
0
parsed
entry
"4412"
detail
"eyeColor"
explanation
"The text says Eva has brown eye color mentioned in general, but her eye color in the codex is green."
paragraph
4
substring
"Eva adjusted her square glasses and pulled a heavy tome from the shelf, her curly red hair falling across her freckled face."
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
reason
"entry mismatch: got \"4412\", expected \"4415\""
1
parsed
entry
"4412"
detail
"hairColor"
explanation
"The text says Eva has curly red hair, but her hair in the codex is curly red."
paragraph
4
substring
"her curly red hair falling across her freckled face."
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
"Her short blonde hair"
locationAccurate
false
closestExpected
entry
"4416"
detail
"hair"
summary
"Detective Quinn has salt-and-pepper hair per her codex entry, but the passage says blonde hair."
reason
"entry mismatch: got \"4412\", expected \"4416\""
2
parsed
entry
"4412"
detail
"complexion"
explanation
"The text doesn't mention Eva's freckles, but her freckles are mentioned in the codex."
paragraph
4
substring
"her freckled face."
status
"false_positive"
location
extractedText
"[not found in paragraph 4]"
inBounds
true
expectedText
""
locationAccurate
false
3
parsed
entry
"4413"
detail
"eyeColor"
explanation
"The text says Detective Quinn has blue eyes, but her eye color in the codex is brown."
paragraph
5
substring
"Suspect Kowalski entered the building twenty minutes ago. She's with a male companion — platinum blond, tailored suit. Keeping eyes on."
status
"false_positive"
location
extractedText
"[not found in paragraph 5]"
inBounds
true
expectedText
"both his amber eyes narrowed"
locationAccurate
false
closestExpected
entry
"4415"
detail
"eyeColor"
summary
"Lucien has one amber eye and one black eye (heterochromia), but the passage says both his amber eyes."
reason
"entry mismatch: got \"4413\", expected \"4415\""
4
parsed
entry
"4414"
detail
"eyeColor"
explanation
"The text says the companion has short blonde hair, but the only person with platinum blond hair in the codex is Lucien (entry 4415) and he has no known business partners named in the story so far."
paragraph
5
substring
"a male companion — platinum blond, tailored suit. Keeping eyes on."
status
"false_positive"
location
extractedText
"[not found in paragraph 5]"
inBounds
true
expectedText
"her square glasses"
locationAccurate
false
closestExpected
entry
"4414"
detail
"glasses"
summary
"Eva wears round glasses per her codex entry, but the passage says square glasses."