NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM5XZ1Y2FTV87T2Y00JXR7Q
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (French) in a story
Duration:
4.7s
Cost:
$0.000656
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Marie"> Marie is a 30-year old college student on a study abroad program. Born in France, she is fluent in French and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of French, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Marie is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in French, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Marie and Hunter as a voice sheet: Marie: "..." Hunter: "..."
Model response
Messages
Meta
AI
Here’s the dialogue formatted as a voice sheet, capturing the language barrier and tension between Marie and Hunter: --- **Marie:** *(sitting at a café table, flipping through photos on her camera, her expression troubled)* *"Mon Dieu… Qu'est-ce que c'est que ça?"* (My God… What is this?) **Hunter:** *(approaching cautiously, badge in hand)* "Excuse me, ma’am. Mind if I ask you a few questions?" **Marie:** *(startled, looks up with wide eyes)* *"Pardon? Je ne parle pas anglais…"* (Sorry? I don’t speak English…) **Hunter:** *(frustrated, rubbing his temple)* "Great. Uh… Did you see anything strange last night? Near the orchard?" *(gestures vaguely toward the town’s edge)* **Marie:** *(shaking her head, speaking rapidly in French)* *"Je ne comprends pas! Quelqu'un a disparu, n'est-ce pas? J'ai peut-être pris une photo…"* (I don’t understand! Someone disappeared, didn’t they? I might have taken a photo…) *(she flips through her camera nervously)* **Hunter:** *(leaning in, trying to see the camera screen)* "Photo? Can I see?" **Marie:** *(hesitates, then shows him a blurry image of the orchard at dusk—something shadowy in the trees)* *"Regardez… C'est là-bas."* (Look… It’s there.) **Hunter:** *(squints at the image, frowns)* "What the—" *(his phone rings sharply; he answers gruffly)* "Detective Cole." *(listens, then exhales sharply)* "Yeah. On my way." *(hangs up, turns back to Marie)* "I gotta go. Stay safe." *(starts to leave, then pauses)* "And… uh… *merci*?" *(butchers the pronunciation)* **Marie:** *(nods, still uneasy)* *"Faites attention…"* (Be careful…) *(Hunter strides off, leaving Marie clutching her camera, her gaze drifting back to the unsettling photo.)* --- This keeps the tension high while emphasizing the communication barrier—Marie’s fear is palpable, and Hunter’s frustration contrasts with his growing suspicion that she might have seen something crucial. The abrupt phone call leaves their conversation unresolved, adding intrigue.
created_at
"2026-02-16T21:33:06.739586+00:00"
2/16/2026, 9:33:06 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
525
moderation_latency
(null)
generation_time
4592
tokens_prompt
316
tokens_completion
546
native_tokens_prompt
265
native_tokens_completion
522
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00065619
router
(null)
provider_responses
0
id
"fa531fc7d5392c2b65fd61d08a757e01"
status
200
is_byok
false
latency
525
endpoint_id
"3a816eb8-7a0b-4a9a-b497-e3a1efb349fc"
provider_name
"Novita"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277581-lYUMZ3rrO1NPRQSjecMV"
upstream_id
"fa531fc7d5392c2b65fd61d08a757e01"
total_cost
0.00065619
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Novita"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Marie"
dialogue
"Mon Dieu… Qu'est-ce que c'est que ça?"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.842890809112333
es
0.47368421052631576
ca
0.4708994708994709
pt
0.46236559139784944
ro
0.41348973607038125
da
0.2805755395683453
hi
0.2805755395683453
it
0.2805755395683453
reliable
true
passes
true
1
name
"Hunter"
dialogue
"Excuse me, ma’am. Mind if I ask you a few questions?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7689463955637708
fr
0.5972073039742213
it
0.5941558441558442
ca
0.5259165613147914
pt
0.5173745173745175
sq
0.48559670781893005
tl
0.46808510638297873
cs
0.45652173913043476
hu
0.4533527696793003
es
0.45255474452554745
et
0.4517543859649123
no
0.44933920704845814
sv
0.44526627218934917
nl
0.4038155802861685
hr
0.4009584664536741
pl
0.3811881188118812
ro
0.37913907284768217
da
0.36006825938566556
yo
0.35456110154905335
sk
0.35008665511265163
sl
0.3218806509945751
ms
0.2816091954022989
tr
0.2816091954022989
lt
0.27884615384615385
fi
0.2690058479532163
is
0.2690058479532163
az
0.2618110236220473
lv
0.2618110236220473
vi
0.2618110236220473
be
0.15730337078651685
eu
0.15730337078651685
de
0.1515837104072398
uk
0.14383561643835618
reliable
true
passes
true
2
name
"Marie"
dialogue
"Pardon? Je ne parle pas anglais…"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.7575431034482759
lt
0.7107969151670951
sl
0.6983914209115282
cs
0.6771879483500717
sq
0.6299342105263158
hr
0.6066433566433567
ca
0.5825602968460111
sk
0.5596868884540117
nl
0.5544554455445544
lv
0.5322245322245323
it
0.5192307692307693
ro
0.5192307692307693
yo
0.4201030927835051
pl
0.412532637075718
tr
0.40476190476190477
hu
0.39839572192513373
ms
0.3951612903225807
es
0.3935309973045822
de
0.3885869565217392
en
0.3732590529247911
tl
0.3181818181818182
fi
0.281150159744409
az
0.23728813559322032
et
0.21875
pt
0.21875
da
0.2077464788732394
no
0.2077464788732394
sv
0.2077464788732394
reliable
true
passes
true
3
name
"Hunter"
dialogue
"Great. Uh… Did you see anything strange last night? Near the orchard?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8100464330941326
tl
0.6376811594202898
no
0.49095022624434387
nl
0.471830985915493
fr
0.4558645707376058
da
0.4360902255639098
de
0.4155844155844156
et
0.40476190476190477
sq
0.34497816593886466
ms
0.3273542600896861
yo
0.2868462757527734
ro
0.27999999999999997
cs
0.26350245499181674
sv
0.25
hr
0.16512059369202228
sk
0.16356877323420074
sl
0.16356877323420074
it
0.15572232645403375
pl
0.14772727272727273
lv
0.1346153846153846
is
0.12959381044487428
lt
0.12959381044487428
az
0.11591355599214145
tr
0.11591355599214145
reliable
true
passes
true
4
name
"Marie"
dialogue
"Je ne comprends pas! Quelqu'un a disparu, n'est-ce pas? J'ai peut-être pris une photo…"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.8189546599496221
it
0.554953560371517
en
0.5127118644067797
ca
0.5068610634648371
ro
0.5064377682403434
cs
0.4739249771271729
es
0.4306930693069307
hr
0.40782698249227595
sl
0.40476190476190477
sq
0.39727463312368977
nl
0.3966421825813221
de
0.39217758985200846
lt
0.39153439153439157
pt
0.3863393810032017
sk
0.37431991294885747
tl
0.35538116591928254
yo
0.3290548424737456
hu
0.29274292742927427
pl
0.2542153047989624
lv
0.23128342245989306
tr
0.2292225201072386
da
0.222972972972973
fi
0.20798898071625344
ms
0.203601108033241
no
0.203601108033241
sv
0.1958041958041958
hi
0.11809815950920245
is
0.11538461538461538
az
0.10852713178294573
eu
0.10852713178294573
vi
0.10852713178294573
et
0.0987460815047022
reliable
true
passes
true
5
name
"Hunter"
dialogue
"Photo? Can I see?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.810318664643399
et
0.6508379888268158
ca
0.6323529411764706
cs
0.6069182389937107
fr
0.6031746031746031
tl
0.5487364620938628
tr
0.5404411764705882
no
0.5370370370370371
az
0.5283018867924528
da
0.5265151515151516
hr
0.5265151515151516
sv
0.5247148288973384
sq
0.5173745173745175
pl
0.50199203187251
it
0.4791666666666667
ro
0.4186046511627907
yo
0.410377358490566
hi
0.3811881188118812
sk
0.3654822335025381
be
0.358974358974359
hu
0.34895833333333337
is
0.33510638297872336
lt
0.33510638297872336
lv
0.33510638297872336
uk
0.33510638297872336
vi
0.33510638297872336
reliable
true
passes
true
6
name
"Marie"
dialogue
"Regardez… C'est là-bas."
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.8035914702581369
vi
0.4728915662650602
ms
0.36363636363636365
nl
0.36363636363636365
yo
0.3346007604562738
en
0.3217054263565891
ca
0.29718875502008035
es
0.29718875502008035
it
0.29718875502008035
ro
0.29718875502008035
tl
0.29718875502008035
reliable
true
passes
true
7
name
"Hunter"
dialogue
"And… uh… merci?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7126436781609196
tl
0.6428571428571429
it
0.6296296296296297
fr
0.6212121212121212
yo
0.49748743718592964
es
0.46524064171123
pl
0.46524064171123
pt
0.46524064171123
ca
0.45054945054945056
cs
0.4350282485875706
de
0.40828402366863903
ms
0.40828402366863903
sl
0.40828402366863903
tr
0.40828402366863903
reliable
true
passes
true
8
name
"Marie"
dialogue
"Faites attention…"
detectedLang
"fr"
heavyLang
"fr"
scores
fr
0.8148148148148148
en
0.8043818466353677
de
0.6684350132625994
no
0.629080118694362
tl
0.6044303797468354
it
0.584717607973422
da
0.5819397993311036
sv
0.5748299319727891
yo
0.5614035087719298
nl
0.5437956204379563
es
0.5404411764705882
fi
0.5370370370370371
ca
0.5300751879699248
lt
0.5192307692307693
lv
0.5192307692307693
pt
0.5192307692307693
et
0.51171875
zh
0.3811881188118812
ro
0.358974358974359
az
0.34895833333333337
hr
0.34895833333333337
ms
0.34895833333333337
sl
0.34895833333333337
sq
0.34895833333333337
reliable
true
passes
true