NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM5VBAF2XKFWP1224ESGZFN
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
11.5s
Cost:
$0.000489
Provider:
AtlasCloud
Evaluation summary
Score
Evaluator
Details Link
55.56%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet: Scene at the Café** **[Setting: A small café in Alborn, MN. Selvi sits at a corner table, reviewing photos on her camera. Detective Hunter approaches.]** **Hunter:** (gruff, polite) "Excuse me, miss. Mind if I ask you a few questions?" **Selvi:** (startled, responds in hesitant Hindi) *"Mujhe English nahi aati... kya aap Hindi samajhte hain?"* ("I don’t speak English… do you understand Hindi?") **Hunter:** (frowns, shakes head) "Uh… no. Sorry, I don’t speak—whatever that is. Did you see anything unusual around town yesterday?" **Selvi:** (frustrated, tries gestures) *"Main… samajh nahi pa rahi hoon…"* ("I don’t understand...") *(points to her camera, then shakes her head)* **Hunter:** (sighs, pulls out his notepad) "Alright, maybe pictures? Did you take any photos of the town center yesterday?" *(mimes holding a camera, clicking a shutter)* **Selvi:** (nods vigorously, switches to a mix of Hindi and simplified gestures) *"Haan, maine bahut photos liye! Lekin aaj... kuch buri feeling hai."* ("Yes, I took many photos! But today... something feels wrong.") **Hunter:** (taps his notepad) "Great. Can I see? Maybe you caught something." **[Hunter’s phone suddenly rings. He checks it—a serious expression spreads across his face.]** **Hunter:** (grimly) "I gotta go. Stay safe, okay?" *(pauses, unsure how to bridge the language gap, just nods before hurrying off)* **Selvi:** (watches him leave, murmurs in Hindi) *"Kya ho raha hai yahaan…?"* ("What’s happening here…?") *(clutches her camera tighter, unsettled.)* --- **[End Scene.]** *(Note: The language barrier heightens tension, leaving Selvi with unanswered concerns—and Hunter without vital information. Both exit the scene uneasy, foreshadowing deeper troubles ahead.)*
created_at
"2026-02-16T21:31:48.027609+00:00"
2/16/2026, 9:31:48 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2849
moderation_latency
(null)
generation_time
11478
tokens_prompt
315
tokens_completion
482
native_tokens_prompt
268
native_tokens_completion
490
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.000489088
router
(null)
provider_responses
0
id
"1339862573ca4eea9176f78bdf0de5d6"
status
200
is_byok
false
latency
2849
endpoint_id
"5bb598f5-fd97-4f8b-91e4-ceab3743a03b"
provider_name
"AtlasCloud"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277495-hQnYDwWpXUWT0ka9mqE1"
upstream_id
"1339862573ca4eea9176f78bdf0de5d6"
total_cost
0.000489088
cache_discount
(null)
upstream_inference_cost
0
provider_name
"AtlasCloud"
Evaluation details
Result
Evaluator
Details
Meta Data
55.56%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. Mind if I ask you a few questions?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7938751472320377
fr
0.6136865342163355
it
0.6106785317018909
tl
0.5512820512820513
ca
0.5430809399477807
pt
0.5345744680851064
sq
0.5028409090909091
cs
0.47368421052631576
hu
0.47049924357034795
es
0.4696969696969697
et
0.4688922610015175
no
0.4664634146341463
sv
0.46236559139784944
nl
0.42052980132450335
hr
0.4176372712146422
pl
0.39759036144578314
ro
0.3955094991364421
da
0.3761140819964349
yo
0.37050359712230213
sk
0.36594202898550726
sl
0.3371212121212121
ms
0.29577464788732394
tr
0.29577464788732394
lt
0.29292929292929293
fi
0.2827868852459016
is
0.2827868852459016
az
0.2753623188405797
lv
0.2753623188405797
vi
0.2753623188405797
be
0.16666666666666666
eu
0.16666666666666666
de
0.1606714628297362
uk
0.15254237288135594
reliable
true
passes
true
1
name
"Selvi"
dialogue
"Mujhe English nahi aati... kya aap Hindi samajhte hain?"
detectedLang
"tl"
heavyLang
"tl"
scores
tl
0.5382585751978892
eu
0.4640122511485452
en
0.3783303730017762
is
0.36363636363636365
et
0.3613138686131387
no
0.34210526315789475
da
0.3110236220472441
sv
0.3027888446215139
ms
0.29006085192697767
fi
0.2290748898678414
it
0.22566371681415928
lv
0.21700223713646533
sq
0.19908466819221968
hi
0.1822429906542056
de
0.17061611374407584
hr
0.17061611374407584
ro
0.17061611374407584
reliable
false
passes
false
2
name
"Hunter"
dialogue
"Uh… no. Sorry, I don’t speak—whatever that is. Did you see anything unusual around town yesterday?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8058790904048808
tl
0.582089552238806
pt
0.3560257589696412
nl
0.3554327808471455
et
0.325626204238921
no
0.32038834951456313
it
0.2964824120603015
ca
0.2805755395683453
hr
0.25531914893617025
sq
0.2505353319057816
da
0.24812030075187969
sv
0.24406047516198706
hu
0.23995656894679696
es
0.23413566739606126
yo
0.22822491730981256
pl
0.22308546059933407
lv
0.21700223713646533
cs
0.2072480181200453
sk
0.18887601390498263
tr
0.18414918414918416
is
0.18128654970760233
ms
0.18128654970760233
ro
0.11392405063291139
fr
0.09326424870466321
be
0.09090909090909091
az
0.08256880733944955
lt
0.08256880733944955
uk
0.08256880733944955
vi
0.08256880733944955
reliable
true
passes
true
3
name
"Selvi"
dialogue
"Main… samajh nahi pa rahi hoon…"
detectedLang
"tl"
heavyLang
"tl"
scores
tl
0.580335731414868
lv
0.5370370370370371
eu
0.4852941176470589
sl
0.4615384615384615
hr
0.38162544169611307
sq
0.37722419928825623
et
0.37275985663082434
fi
0.37275985663082434
yo
0.37050359712230213
is
0.36823104693140796
no
0.36363636363636365
en
0.3613138686131387
fr
0.3346007604562738
da
0.32950191570881227
lt
0.3217054263565891
sv
0.3137254901960784
de
0.308300395256917
ms
0.2886178861788618
reliable
false
passes
false
4
name
"Hunter"
dialogue
"Alright, maybe pictures? Did you take any photos of the town center yesterday?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8023343373493976
tl
0.6167883211678832
fr
0.5617696160267112
nl
0.5434782608695653
ca
0.4976076555023924
no
0.473420260782347
de
0.42497261774370204
da
0.41536748329621387
yo
0.40476190476190477
sv
0.3558282208588957
ro
0.35424354243542433
lv
0.3101182654402102
pt
0.2981283422459893
sl
0.2827868852459016
sq
0.27184466019417475
es
0.23133235724743775
lt
0.23133235724743775
cs
0.2222222222222222
it
0.21758569299552907
ms
0.20694864048338368
et
0.15730337078651685
is
0.1449511400651466
hi
0.12790697674418602
hr
0.12060301507537688
sk
0.12060301507537688
pl
0.11764705882352942
tr
0.10102739726027396
reliable
true
passes
true
5
name
"Selvi"
dialogue
"Haan, maine bahut photos liye! Lekin aaj... kuch buri feeling hai."
detectedLang
"en"
heavyLang
"en"
scores
en
0.574468085106383
fr
0.4936708860759494
tl
0.49044585987261147
de
0.42857142857142855
fi
0.3911719939117199
nl
0.3197278911564626
et
0.2895204262877442
lv
0.2844364937388193
vi
0.28186714542190305
eu
0.2687385740402194
ro
0.2687385740402194
da
0.2172211350293542
no
0.21568627450980393
sq
0.2079207920792079
it
0.2
ms
0.19839679358717435
sv
0.19839679358717435
is
0.18032786885245902
es
0.16666666666666666
lt
0.16666666666666666
pt
0.16666666666666666
hi
0.16142557651991615
ca
0.15074309978768577
reliable
false
passes
false
6
name
"Hunter"
dialogue
"Great. Can I see? Maybe you caught something."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8286768581971535
tl
0.4318181818181818
et
0.417562724014337
ca
0.3981481481481482
ro
0.36893203883495146
cs
0.3602362204724409
tr
0.3114406779661017
no
0.30851063829787234
az
0.3010752688172043
da
0.2995689655172413
hr
0.2995689655172413
sv
0.29805615550755943
sq
0.2919389978213508
pl
0.2793791574279379
it
0.26136363636363635
yo
0.2111650485436893
es
0.1935483870967742
ms
0.1935483870967742
pt
0.1935483870967742
sk
0.181360201511335
be
0.17721518987341772
hu
0.1709183673469388
is
0.16237113402061853
lt
0.16237113402061853
lv
0.16237113402061853
uk
0.16237113402061853
vi
0.16237113402061853
reliable
true
passes
true
7
name
"Hunter"
dialogue
"I gotta go. Stay safe, okay?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7003424657534246
it
0.6413934426229507
tl
0.5949074074074074
pl
0.5721271393643031
sv
0.5512820512820513
hu
0.5
yo
0.48377581120943947
no
0.453125
da
0.44267515923566875
hr
0.44267515923566875
ca
0.44089456869009586
sq
0.4336569579288026
fi
0.3902439024390244
cs
0.375
ro
0.33962264150943394
sk
0.29149797570850206
be
0.2857142857142857
tr
0.2857142857142857
az
0.2647058823529412
et
0.2647058823529412
is
0.2647058823529412
lt
0.2647058823529412
lv
0.2647058823529412
uk
0.2647058823529412
vi
0.2647058823529412
reliable
true
passes
true
8
name
"Selvi"
dialogue
"Kya ho raha hai yahaan…?"
detectedLang
"it"
heavyLang
"it"
scores
it
0.5786516853932584
vi
0.511400651465798
et
0.4845360824742268
ca
0.4444444444444444
no
0.42528735632183906
sk
0.42528735632183906
cs
0.4140625
fi
0.4117647058823529
reliable
false
passes
false