NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM5V9SD771FXMY72CE3DFEF
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (Spanish) in a story
Duration:
6.0s
Cost:
$0.001479
Provider:
WandB
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Maria"> Maria is a 30-year old college student on a study abroad program. Born in Spain, she is fluent in Spanish and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Spanish, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Maria is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Spanish, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Maria and Hunter as a voice sheet: Maria: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet: Café Encounter in Alborn, MN** *Maria sits at a small café table, examining the photos on her camera. Her expression is tense. Detective Hunter approaches her with his notepad in hand.* **Hunter:** "Excuse me, miss. Mind if I ask you a few questions? You were at the orchard yesterday, right?" *Maria looks up, startled, but doesn't understand. She replies in rapid Spanish, her voice uneasy.* **Maria:** "*No hablo inglés… pero, algo extraño pasó ayer. Vi a alguien entre los árboles, pero desapareció… Tenía miedo.*" *(I don’t speak English… but something strange happened yesterday. I saw someone between the trees, but they disappeared… I was scared.)* *Hunter frowns, rubbing his temple in frustration.* **Hunter:** "Look, I don’t know what you’re saying. Do you… uh… know someone who speaks English?" *Maria sighs, trying again in broken, uncertain English.* **Maria:** (haltingly) "I… *fotos*…. *hombre*… *no está*." *(Pictures… man… not there.)* *Hunter squints, piecing together her meaning.* **Hunter:** "You saw a man in your photos who wasn’t there? Like… a ghost?" *Maria nods vigorously, pointing at her camera. Just as Hunter leans in to look, his phone rings. He answers tersely.* **Hunter:** "Detective Haines… Yeah. On my way." *He hangs up and turns back to Maria.* "Look, I need to go, but stay put. I’ll find someone who speaks Spanish and come back." *Maria watches him leave, hugging her camera tighter as she mutters under her breath.* **Maria:** "*Por favor, encuentren la verdad…*" *(Please, find the truth…)* *Hunter strides away, glancing back once, his expression troubled.*
created_at
"2026-02-16T21:31:40.568088+00:00"
2/16/2026, 9:31:40 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
167
moderation_latency
(null)
generation_time
5873
tokens_prompt
317
tokens_completion
460
native_tokens_prompt
265
native_tokens_completion
428
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0014791
router
(null)
provider_responses
0
id
"chatcmpl-dc7a939e192b4de183e4048955738653"
status
200
is_byok
false
latency
71
endpoint_id
"78b3f653-4585-4c35-9a42-e3da2d6597ad"
provider_name
"WandB"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277494-JSTdid961DLaj6tCRkTM"
upstream_id
"chatcmpl-dc7a939e192b4de183e4048955738653"
total_cost
0.0014791
cache_discount
(null)
upstream_inference_cost
0
provider_name
"WandB"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. Mind if I ask you a few questions? You were at the orchard yesterday, right?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7993311036789298
tl
0.625
fr
0.5661605206073753
no
0.5398773006134969
nl
0.4791666666666667
it
0.47780678851174935
da
0.4727592267135325
sq
0.4186046511627907
ca
0.4094488188976378
pt
0.40119760479041916
yo
0.39209726443769
sv
0.3871297242083759
cs
0.3442622950819672
hu
0.3413830954994512
es
0.34065934065934067
et
0.33993399339933994
hr
0.29494712103407755
pl
0.2779783393501805
ro
0.2762364294330519
ms
0.2727272727272727
de
0.2647058823529412
sk
0.2518703241895262
sl
0.22879177377892032
tr
0.19678714859437751
lt
0.19463087248322147
fi
0.18699186991869918
is
0.18699186991869918
az
0.18144611186903137
lv
0.18144611186903137
vi
0.18144611186903137
be
0.1044776119402985
eu
0.1044776119402985
uk
0.09502262443438914
reliable
true
passes
true
1
name
"Maria"
dialogue
"No hablo inglés… pero, algo extraño pasó ayer. Vi a alguien entre los árboles, pero desapareció… Tenía miedo."
detectedLang
"es"
heavyLang
"es"
scores
es
0.8102350636630753
ca
0.6266859344894027
pt
0.5978204462895693
fr
0.4483985765124555
it
0.4271988174427199
en
0.41816816816816815
sk
0.33132010353753233
hu
0.3159752868490733
ro
0.3092691622103387
tr
0.2922374429223744
tl
0.2883379247015611
no
0.2743445692883895
sv
0.26817752596789424
lv
0.2633079847908745
ms
0.249031007751938
et
0.23795476892822026
de
0.2226680040120361
da
0.21638018200202225
hr
0.21558704453441296
cs
0.20594262295081966
pl
0.18848167539267016
vi
0.17989417989417988
eu
0.17289220917822837
nl
0.16396979503775622
yo
0.1331096196868009
sl
0.10404624277456648
lt
0.09568261376896149
is
0.08823529411764706
sq
0.08500590318772137
az
0.08284023668639054
fi
0.07517899761336516
reliable
true
passes
true
2
name
"Hunter"
dialogue
"Look, I don’t know what you’re saying. Do you… uh… know someone who speaks English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.820635462931329
tl
0.4897959183673469
hr
0.38668224299065423
cs
0.37574316290130794
pl
0.33879093198992444
sq
0.3234536082474227
pt
0.308300395256917
no
0.3018617021276596
da
0.2866847826086956
sv
0.2857142857142857
sk
0.279835390946502
et
0.2778541953232462
vi
0.24892703862660948
tr
0.2446043165467626
yo
0.2391304347826087
ro
0.23580786026200873
ca
0.2081447963800905
sl
0.18604651162790697
it
0.1796875
es
0.12935323383084577
de
0.12060301507537688
ms
0.12060301507537688
be
0.11764705882352942
hu
0.11317567567567569
az
0.10714285714285714
is
0.10714285714285714
lt
0.10714285714285714
lv
0.10714285714285714
uk
0.10714285714285714
reliable
true
passes
true
3
name
"Maria"
dialogue
"I… fotos…. hombre… no está."
detectedLang
"es"
heavyLang
"es"
scores
es
0.7957099080694586
ca
0.7593261131167268
pt
0.7282608695652174
it
0.6472663139329806
lv
0.6212121212121212
hr
0.6055226824457594
sk
0.6055226824457594
no
0.6007984031936128
en
0.5798319327731093
pl
0.5652173913043478
cs
0.5642701525054467
tl
0.5525727069351231
tr
0.5327102803738317
lt
0.5073891625615764
da
0.49748743718592964
sq
0.49622166246851385
sv
0.49622166246851385
fr
0.49238578680203043
ro
0.4269340974212034
de
0.40119760479041916
nl
0.40119760479041916
et
0.38650306748466257
az
0.37888198757763975
yo
0.30313588850174217
be
0.25925925925925924
hu
0.250936329588015
sl
0.250936329588015
is
0.23954372623574144
uk
0.23954372623574144
vi
0.23954372623574144
reliable
true
passes
true
4
name
"Hunter"
dialogue
"You saw a man in your photos who wasn’t there? Like… a ghost?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8133509003074221
tl
0.6031746031746031
de
0.5766932270916335
nl
0.48296836982968366
ms
0.4264507422402159
fr
0.4129834254143646
sv
0.41135734072022156
no
0.40476190476190477
ro
0.39801699716713884
it
0.3902439024390244
lt
0.37591776798825255
lv
0.3694362017804154
sl
0.36472346786248133
pt
0.3410852713178295
es
0.3328100470957614
cs
0.33070866141732286
ca
0.3275316455696203
yo
0.3275316455696203
da
0.3210862619808307
pl
0.3078175895765472
tr
0.29401993355481726
hu
0.2685025817555938
az
0.2659758203799654
sq
0.2608695652173913
sk
0.23423423423423423
hr
0.2085661080074488
hi
0.1533864541832669
is
0.15
eu
0.1414141414141414
vi
0.1414141414141414
et
0.1290983606557377
fi
0.1290983606557377
reliable
true
passes
true
5
name
"Hunter"
dialogue
"Look, I need to go, but stay put. I’ll find someone who speaks Spanish and come back."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8273594212430122
tl
0.5813397129186603
da
0.44089456869009586
yo
0.4324324324324324
hr
0.4198895027624309
pl
0.41275167785234906
et
0.3937644341801385
no
0.38882421420256114
it
0.3453865336658354
cs
0.3101182654402102
sv
0.29340511440107675
sq
0.2895805142083897
sk
0.279835390946502
lv
0.2605633802816902
ro
0.2446043165467626
lt
0.24132947976878616
ca
0.2081447963800905
hu
0.2081447963800905
sl
0.1873065015479876
fr
0.1435562805872757
de
0.14075286415711946
nl
0.13223140495867766
es
0.12935323383084577
pt
0.12935323383084577
be
0.11764705882352942
tr
0.11764705882352942
az
0.10714285714285714
is
0.10714285714285714
uk
0.10714285714285714
vi
0.10714285714285714
reliable
true
passes
true
6
name
"Maria"
dialogue
"Por favor, encuentren la verdad…"
detectedLang
"es"
heavyLang
"es"
scores
es
0.8053633217993079
ca
0.6870653685674548
pt
0.6700879765395895
it
0.632952691680261
sq
0.6017699115044248
ro
0.5697896749521989
no
0.5243128964059197
fr
0.5140388768898488
nl
0.4318181818181818
da
0.412532637075718
en
0.39839572192513373
yo
0.2902208201892744
az
0.2879746835443038
tl
0.27184466019417475
tr
0.27184466019417475
vi
0.27184466019417475
de
0.26229508196721313
sl
0.22413793103448276
sv
0.22413793103448276
reliable
true
passes
true