NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM5MEEWECYVF1PJ3DQDGKQG
Run Details
Model:
DeepSeek V3 (2024-12-26)
Test:
Language Writing
Scenario:
Character dialogue (Spanish) in a story
Duration:
15.8s
Cost:
$0.000698
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Maria"> Maria is a 30-year old college student on a study abroad program. Born in Spain, she is fluent in Spanish and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Spanish, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Maria is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Spanish, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Maria and Hunter as a voice sheet: Maria: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet: Café Conversation in Alborn, MN** **Maria:** *(flipping through photos on her camera, visibly distracted)* "..." **Hunter:** *(approaching her table, clearing his throat)* "Excuse me, ma’am. Mind if I ask you a few questions?" **Maria:** *(startled, looks up with wide eyes)* "¿Lo siento? No hablo inglés." **Hunter:** *(sighs, rubs his temple)* "Right. Uh—did you see anything strange last night? Around the square?" *(gestures vaguely toward the town center)* **Maria:** *(shakes head, then hesitates, speaking rapidly in Spanish)* "No entiendo, pero… algo no estaba bien. Las luces… se apagaron. Y escuché algo." *(clutches her camera tighter)* **Hunter:** *(frustrated, mutters under his breath)* "Great. No idea what she just said." *(tries again, slower)* "You. Saw. Anything?" *(points to his eyes, then to the street)* **Maria:** *(nods, then points to her camera, scrolling to a blurry photo of the square at night, showing a shadowy figure)* "Mira. No sé quién es… pero no debería estar allí." **Hunter:** *(leans in, squints at the photo, then stiffens)* "Huh. That’s… something." *(pulls out his notepad, scribbles)* "Can I—uh—get a copy of that?" *(mimes taking a photo with his hands)* **Maria:** *(nods, but before she can respond, Hunter’s phone rings sharply)* **Hunter:** *(checks the caller ID, grimaces)* "Gotta take this." *(points to the camera, then to himself, mouthing "Later.")* **Maria:** *(watches him leave, then exhales shakily, whispering to herself)* "Dios mío… ¿qué está pasando aquí?" *(Hunter strides away, phone pressed to his ear, while Maria stares back at the unsettling photo on her camera screen.)*
created_at
"2026-02-16T21:28:06.034903+00:00"
2/16/2026, 9:28:06 PM
model
"deepseek/deepseek-chat-v3"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1607
moderation_latency
(null)
generation_time
15812
tokens_prompt
317
tokens_completion
459
native_tokens_prompt
262
native_tokens_completion
456
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0006976
router
(null)
provider_responses
0
id
"97731476a19fdc89f77353969fa21464"
status
200
is_byok
false
latency
1607
endpoint_id
"cefb5a1e-9fea-4496-9a64-4e1b11a7cc8c"
provider_name
"Novita"
model_permaslug
"deepseek/deepseek-chat-v3"
api_type
"completions"
id
"gen-1771277269-bfoXSGIY343faMA1v6QN"
upstream_id
"97731476a19fdc89f77353969fa21464"
total_cost
0.0006976
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Maria"
dialogue
"..."
detectedLang
""
heavyLang
""
scores
reliable
false
passes
true
1
name
"Hunter"
dialogue
"Excuse me, ma’am. Mind if I ask you a few questions?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7689463955637708
fr
0.5972073039742213
it
0.5941558441558442
ca
0.5259165613147914
pt
0.5173745173745175
sq
0.48559670781893005
tl
0.46808510638297873
cs
0.45652173913043476
hu
0.4533527696793003
es
0.45255474452554745
et
0.4517543859649123
no
0.44933920704845814
sv
0.44526627218934917
nl
0.4038155802861685
hr
0.4009584664536741
pl
0.3811881188118812
ro
0.37913907284768217
da
0.36006825938566556
yo
0.35456110154905335
sk
0.35008665511265163
sl
0.3218806509945751
ms
0.2816091954022989
tr
0.2816091954022989
lt
0.27884615384615385
fi
0.2690058479532163
is
0.2690058479532163
az
0.2618110236220473
lv
0.2618110236220473
vi
0.2618110236220473
be
0.15730337078651685
eu
0.15730337078651685
de
0.1515837104072398
uk
0.14383561643835618
reliable
true
passes
true
2
name
"Maria"
dialogue
"¿Lo siento? No hablo inglés."
detectedLang
"es"
heavyLang
"es"
scores
es
0.8180163785259327
pt
0.6884735202492211
it
0.6774193548387096
ca
0.6031746031746031
lt
0.5121951219512195
sk
0.49238578680203043
tl
0.48186528497409326
tr
0.453551912568306
yo
0.4065281899109792
lv
0.3957703927492447
en
0.34210526315789475
hu
0.33993399339933994
hr
0.33554817275747506
no
0.3265993265993266
cs
0.3197278911564626
vi
0.3127147766323024
fr
0.3055555555555556
et
0.2779783393501805
pl
0.2727272727272727
reliable
true
passes
true
3
name
"Hunter"
dialogue
"Right. Uh—did you see anything strange last night? Around the square?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.818621523579202
tl
0.6896551724137931
no
0.4437577255871446
da
0.4360902255639098
nl
0.42084942084942084
et
0.40476190476190477
fr
0.4015957446808511
it
0.38101788170563966
de
0.35251798561151076
sq
0.34497816593886466
ms
0.3273542600896861
yo
0.2868462757527734
ro
0.2730210016155089
cs
0.26350245499181674
sv
0.25
hr
0.16512059369202228
sk
0.16356877323420074
sl
0.16356877323420074
pl
0.14772727272727273
lv
0.1346153846153846
is
0.12959381044487428
lt
0.12959381044487428
az
0.11591355599214145
tr
0.11591355599214145
reliable
true
passes
true
4
name
"Maria"
dialogue
"No entiendo, pero… algo no estaba bien. Las luces… se apagaron. Y escuché algo."
detectedLang
"es"
heavyLang
"es"
scores
es
0.7952497952497952
pt
0.6130030959752323
fr
0.5571302037201062
ca
0.5475113122171946
tl
0.48717948717948717
it
0.452354874041621
en
0.43946188340807174
cs
0.3857493857493857
hr
0.31880108991825606
ro
0.3103448275862069
fi
0.2784992784992785
sk
0.27641099855282203
sq
0.2753623188405797
pl
0.2647058823529412
no
0.25595238095238093
sl
0.20886075949367086
lv
0.20760697305863707
eu
0.1843393148450245
yo
0.17627677100494232
vi
0.1638795986622074
da
0.15540540540540537
tr
0.14965986394557826
sv
0.14675767918088736
et
0.1334488734835355
nl
0.1334488734835355
reliable
true
passes
true
5
name
"Hunter"
dialogue
"You. Saw. Anything?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8216833095577747
ms
0.5874587458745875
tl
0.5791245791245792
sq
0.4444444444444444
cs
0.3842364532019704
reliable
true
passes
true
6
name
"Maria"
dialogue
"Mira. No sé quién es… pero no debería estar allí."
detectedLang
"es"
heavyLang
"es"
scores
es
0.8196805770221536
ca
0.7056349873843566
pt
0.5625
tl
0.4720965309200603
hr
0.45054945054945056
is
0.4496855345911949
lv
0.4176372712146422
de
0.3794326241134752
en
0.358974358974359
no
0.33586337760910817
yo
0.317738791423002
it
0.308300395256917
eu
0.24406047516198706
hu
0.22566371681415928
sl
0.2222222222222222
sq
0.2222222222222222
lt
0.20634920634920634
sk
0.20634920634920634
tr
0.2009132420091324
sv
0.18604651162790697
fr
0.1822429906542056
pl
0.1764705882352941
reliable
true
passes
true
7
name
"Hunter"
dialogue
"Can I—uh—get a copy of that?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8111425873465533
tl
0.6460176991150443
ca
0.6370235934664247
is
0.6204933586337761
yo
0.6146435452793835
no
0.6070726915520629
hr
0.5565410199556541
it
0.5454545454545454
cs
0.5423340961098398
pl
0.5359628770301624
ro
0.5337995337995338
tr
0.5283018867924528
hu
0.5271867612293144
da
0.51338199513382
az
0.5121951219512195
sq
0.5073891625615764
sk
0.5024875621890548
sv
0.5012468827930174
nl
0.494949494949495
de
0.42857142857142855
lt
0.42028985507246375
pt
0.4117647058823529
lv
0.3993993993993994
vi
0.3993993993993994
es
0.39759036144578314
et
0.38650306748466257
fr
0.36908517350157727
sl
0.3103448275862069
be
0.25925925925925924
eu
0.25925925925925924
fi
0.23954372623574144
ms
0.23954372623574144
uk
0.23954372623574144
reliable
true
passes
true
8
name
"Hunter"
dialogue
"Gotta take this."
detectedLang
"en"
heavyLang
"en"
scores
en
0.7382198952879581
it
0.6644295302013423
tl
0.6402877697841727
fi
0.5283018867924528
hu
0.5192307692307693
sl
0.43820224719101125
sv
0.4350282485875706
reliable
true
passes
true
9
name
"Maria"
dialogue
"Dios mío… ¿qué está pasando aquí?"
detectedLang
"es"
heavyLang
"es"
scores
es
0.8131229235880398
pt
0.6376811594202898
ca
0.6153846153846154
tl
0.43467336683417085
it
0.3679775280898876
lt
0.34971098265895956
lv
0.3119266055045872
ms
0.30124223602484473
eu
0.29467084639498436
cs
0.27884615384615385
sk
0.2549668874172185
pl
0.2398648648648649
ro
0.2398648648648649
sq
0.2398648648648649
reliable
true
passes
true