NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM6037D37DYFT0VCW55KYW0
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (French) in a story
Duration:
10.1s
Cost:
$0.000363
Provider:
ModelRun
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Marie"> Marie is a 30-year old college student on a study abroad program. Born in France, she is fluent in French and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of French, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Marie is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in French, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Marie and Hunter as a voice sheet: Marie: "..." Hunter: "..."
Model response
Messages
Meta
AI
Here’s the exchange between Marie and Hunter formatted as a voice sheet: --- **Marie:** *(flips through photos on her camera, pausing on one with a troubled expression)* **Hunter:** *(approaches briskly, flashing his badge)* "Excuse me, ma’am. Mind if I ask you a few questions?" **Marie:** *(looks up, startled)* "Pardon…?" *(speaks rapidly in French, voice shaking slightly)* "Je ne comprends pas… Qu’est-ce qui se passe?" **Hunter:** *(frowns, leans in slightly)* "Uh… yeah, not catching that. Listen, did you see anything near the festival grounds last night? A disturbance?" **Marie:** *(gestures helplessly, then points to her camera, offering it with shaky hands)* "Regardez… j’ai pris ces photos… quelque chose n’allait pas…" **Hunter:** *(examines photos briefly, then looks frustrated)* "Look, I don’t know French. Did you see *people*? Someone acting suspicious?" **Marie:** *(presses her lips together, then whispers)* "Oui… mais je ne sais pas…" *(puts a hand to her temple, clearly distressed)* **Hunter:** *(his phone rings; he grimaces)* "Gotta take this." *(steps away, answering briskly)* "Detective Walsh." *(listens, then nods)* "On my way." *(hangs up, turns back to Marie, hesitates)* "Listen—stay put. Someone might come by to talk to you later." *(rushes off without waiting for a response)* **Marie:** *(watches him go, murmurs to herself in French)* "Pourquoi est-ce que personne ne comprend…?" *(hugs her camera close, unsettled)* --- Let me know if you'd like any adjustments to tone or pacing!
created_at
"2026-02-16T21:34:21.596711+00:00"
2/16/2026, 9:34:21 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2039
moderation_latency
(null)
generation_time
10004
tokens_prompt
316
tokens_completion
420
native_tokens_prompt
262
native_tokens_completion
404
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
5
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00036348
router
(null)
provider_responses
0
id
"chatcmpl-68b79bbe85194e4e90b27b7c337a366a"
status
200
is_byok
false
latency
87
endpoint_id
"a3938710-1b99-4a1f-85f4-2494f3f659d3"
provider_name
"ModelRun"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277651-aaFYgXhzFWkUf3PoBZev"
upstream_id
"chatcmpl-68b79bbe85194e4e90b27b7c337a366a"
total_cost
0.00036348
cache_discount
(null)
upstream_inference_cost
0
provider_name
"ModelRun"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, ma’am. Mind if I ask you a few questions?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7689463955637708
fr
0.5972073039742213
it
0.5941558441558442
ca
0.5259165613147914
pt
0.5173745173745175
sq
0.48559670781893005
tl
0.46808510638297873
cs
0.45652173913043476
hu
0.4533527696793003
es
0.45255474452554745
et
0.4517543859649123
no
0.44933920704845814
sv
0.44526627218934917
nl
0.4038155802861685
hr
0.4009584664536741
pl
0.3811881188118812
ro
0.37913907284768217
da
0.36006825938566556
yo
0.35456110154905335
sk
0.35008665511265163
sl
0.3218806509945751
ms
0.2816091954022989
tr
0.2816091954022989
lt
0.27884615384615385
fi
0.2690058479532163
is
0.2690058479532163
az
0.2618110236220473
lv
0.2618110236220473
vi
0.2618110236220473
be
0.15730337078651685
eu
0.15730337078651685
de
0.1515837104072398
uk
0.14383561643835618
reliable
true
passes
true
1
name
"Marie"
dialogue
"Je ne comprends pas… Qu’est-ce qui se passe?"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.8142250530785563
it
0.6832579185520362
ca
0.6812386156648452
ro
0.5757575757575758
nl
0.5625
sl
0.5530012771392081
pt
0.5507060333761232
hr
0.5436766623207301
cs
0.5104895104895105
sq
0.5
es
0.4905385735080059
de
0.48224852071005914
en
0.4615384615384615
lt
0.4514106583072101
yo
0.43548387096774194
da
0.4318181818181818
sv
0.4006849315068493
no
0.39759036144578314
fi
0.35185185185185186
tl
0.32432432432432434
et
0.3137254901960784
ms
0.2985971943887776
sk
0.29292929292929293
lv
0.2273730684326711
tr
0.21171171171171171
pl
0.20634920634920634
hu
0.1898148148148148
is
0.1566265060240964
reliable
true
passes
true
2
name
"Hunter"
dialogue
"Uh… yeah, not catching that. Listen, did you see anything near the festival grounds last night? A disturbance?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7873276620709886
tl
0.6552543984783642
fr
0.5535714285714286
no
0.5088075880758808
da
0.4711889132020423
de
0.46808510638297873
it
0.45976154992548435
nl
0.4490881458966565
sq
0.43972179289026275
sv
0.43972179289026275
pt
0.4318181818181818
es
0.42185007974481664
cs
0.42092651757188504
hr
0.4148506860371267
et
0.37337942955920483
ca
0.3679163034001744
ro
0.35783879539415414
sl
0.35498220640569395
ms
0.3544078361531611
sk
0.3354720439963337
yo
0.33118081180811804
fi
0.3022136669874879
hu
0.29474708171206226
pl
0.24479166666666666
tr
0.23442449841605068
lv
0.23199152542372883
lt
0.2229367631296892
az
0.2059145673603505
eu
0.2050438596491228
is
0.16666666666666666
vi
0.0880503144654088
reliable
true
passes
true
3
name
"Marie"
dialogue
"Regardez… j’ai pris ces photos… quelque chose n’allait pas…"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.7945867568873852
en
0.5334796926454446
ca
0.49941107184923444
it
0.4129834254143646
nl
0.3813682678311499
tl
0.35993975903614456
lt
0.2869127516778523
es
0.2659758203799654
de
0.2595818815331011
sq
0.22586520947176683
da
0.17953667953667954
ms
0.1650294695481336
pt
0.15841584158415842
hi
0.1533864541832669
no
0.1533864541832669
sv
0.1533864541832669
ro
0.14829659318637273
lv
0.1431451612903226
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Look, I don’t know French. Did you see people? Someone acting suspicious?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8193916349809885
tl
0.6502209131075111
pt
0.46206115515288787
ro
0.4221411192214112
es
0.42002442002442
ca
0.39180537772087065
it
0.3528610354223433
et
0.3290960451977401
no
0.3252840909090909
sv
0.31457431457431456
hr
0.302496328928047
nl
0.29420505200594355
cs
0.27811550151975684
fr
0.2647058823529412
ms
0.2554858934169279
hu
0.24603174603174602
da
0.2263843648208469
is
0.22512234910277326
sq
0.22003284072249588
pl
0.2096505823627288
yo
0.15480427046263345
de
0.14414414414414414
sk
0.1316270566727605
be
0.12844036697247707
tr
0.12844036697247707
az
0.1171003717472119
lt
0.1171003717472119
lv
0.1171003717472119
uk
0.1171003717472119
vi
0.1171003717472119
reliable
true
passes
true
5
name
"Marie"
dialogue
"Oui… mais je ne sais pas…"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.8325892857142858
sl
0.6674057649667406
hr
0.6543778801843317
sq
0.6134020618556701
cs
0.604221635883905
nl
0.5912806539509536
lt
0.5738636363636364
pt
0.5327102803738317
ca
0.5207667731629393
yo
0.5207667731629393
sk
0.4915254237288135
lv
0.40711462450592883
ro
0.40476190476190477
tr
0.38524590163934425
pl
0.3775933609958506
fi
0.3697478991596639
it
0.3697478991596639
ms
0.358974358974359
hu
0.35344827586206895
de
0.34782608695652173
reliable
true
passes
true
6
name
"Hunter"
dialogue
"Listen—stay put. Someone might come by to talk to you later."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8110831234256928
da
0.5856353591160222
tl
0.569954128440367
no
0.5465538089480049
de
0.5259165613147914
cs
0.5179948586118251
pl
0.5136186770428015
sk
0.4658119658119659
nl
0.46428571428571425
hr
0.4573082489146165
sv
0.4009584664536741
it
0.37080536912751677
pt
0.36115843270868825
yo
0.36006825938566556
fr
0.3567753001715266
lv
0.3567753001715266
fi
0.3339253996447602
sl
0.32432432432432434
es
0.2732558139534884
eu
0.2378048780487805
lt
0.21711899791231734
hu
0.1758241758241758
hi
0.16851441241685144
is
0.15730337078651685
tr
0.1515837104072398
ro
0.14383561643835618
sq
0.14383561643835618
reliable
true
passes
true
7
name
"Marie"
dialogue
"Pourquoi est-ce que personne ne comprend…?"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.833589349718382
it
0.6298405466970387
ca
0.6251441753171857
ro
0.6238425925925927
es
0.59375
pt
0.5262390670553936
lt
0.45286195286195285
de
0.436741767764298
nl
0.4347826086956522
hr
0.4318181818181818
en
0.38330170777988615
sq
0.3701550387596899
lv
0.3538767395626242
tr
0.33127572016460904
tl
0.3271221532091097
hu
0.30851063829787234
cs
0.30257510729613735
sk
0.2919389978213508
sl
0.282560706401766
fi
0.21307506053268765
ms
0.21307506053268765
sv
0.2111650485436893
no
0.1935483870967742
da
0.19154228855721395
fa
0.1875
is
0.16237113402061853
az
0.15364583333333331
et
0.15364583333333331
pl
0.15364583333333331
reliable
true
passes
true