NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM62WYWCSMSZ17WPG5RSD84
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (German) in a story
Duration:
14.8s
Cost:
$0.000497
Provider:
ModelRun
Evaluation summary
Score
Evaluator
Details Link
88.89%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Melanie"> Melanie is a 30-year old college student on a study abroad program. Born in Germany, she is fluent in German and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of German, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Melanie is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in German, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Melanie and Hunter as a voice sheet: Melanie: "..." Hunter: "..."
Model response
Messages
Meta
AI
Here’s the dialogue formatted as a voice sheet: --- **Melanie:** (softly, looking at her camera screen) *"Das ist so seltsam... Ich habe gestern Abend etwas Seltsames aufgenommen..."* (That’s so strange… I captured something strange last night…) **Hunter:** (stern, crossing his arms) "Excuse me, ma’ Bomb.不影响实际发表时间. 这些漏洞并非所有都直接影响发币模式的安全性,但如果其中某些漏洞被组合利用,或者在特定条件下被触发,可能导致更严重的问题。 建议项目方: 1. **严格校验抵押率计算** - 确保抵押率更新逻辑准确,避免超额抵押或低于预期的情况。**Hunter:** (firmly) "I’m sorry, I don’t speak German. Do you know anything about the missing hiker last night?" **Melanie:** (tense, switching to slow, halting German) *"Ich verstehe nicht... Aber ich habe Fotos. Vielleicht etwas... unheimlich."* (I don’t understand… But I have photos. Maybe something… eerie.) **Hunter:** (frustrated) "Look, I’m trying to solve a case here. If you saw something, I need to know." **Melanie:** (pointing at her camera, pushing it forward) *"Hier. Sehen Sie."* (Here. Look.) **Hunter:** (glancing at the photos, frown deepening) "Okay... I can’t tell what I’m looking at, but if you think this is important—" *(His phone rings abruptly.)* **Hunter:** (answering) "Detective Hayes... What? Yeah, I’ll be right there." *(Hangs up, frustrated sigh)* "Listen, I have to go. But stay here—I’ll find someone who speaks German." *(Starts to leave, then pauses)* "Don’t... delete those photos." **Melanie:** (nods quickly, clutching her camera) *"Ich warte."* (I’ll wait.) *(Hunter exits in a hurry, leaving Melanie sitting anxiously, reviewing her photos with growing unease.)* --- ### Notes: - **Melanie’s tone** – Troubled, hesitant, but insistent when trying to communicate through her photos. - **Hunter’s tone** – Firm, impatient, but briefly intrigued by the photos before the interruption. - **Key moment** – Hunter recognizes the language barrier but still suspects she might have valuable information. - **End implication** – There’s something unsettling in Melanie’s photos, and Hunter will return with a translator.
created_at
"2026-02-16T21:35:58.433932+00:00"
2/16/2026, 9:35:58 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
862
moderation_latency
(null)
generation_time
14802
tokens_prompt
319
tokens_completion
641
native_tokens_prompt
266
native_tokens_completion
576
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
265
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00049672
router
(null)
provider_responses
0
id
"chatcmpl-a9bd95acef894f989319022acb28758e"
status
200
is_byok
false
latency
176
endpoint_id
"a3938710-1b99-4a1f-85f4-2494f3f659d3"
provider_name
"ModelRun"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277743-2pMHVvBZYWVUo54Ochb0"
upstream_id
"chatcmpl-a9bd95acef894f989319022acb28758e"
total_cost
0.00049672
cache_discount
(null)
upstream_inference_cost
0
provider_name
"ModelRun"
Evaluation details
Result
Evaluator
Details
Meta Data
88.89%
Parse dialogue
n/a
dialogue
0
name
"Melanie"
dialogue
"Das ist so seltsam... Ich habe gestern Abend etwas Seltsames aufgenommen..."
detectedLang
"de"
heavyLang
"de"
scores
de
0.8153905946366109
en
0.4385342789598109
nl
0.43653618030842234
no
0.42073170731707316
pt
0.4077306733167082
et
0.3766404199475065
sv
0.36241610738255037
tl
0.34392265193370164
sk
0.326241134751773
yo
0.3004418262150221
da
0.2857142857142857
ca
0.2566510172143975
sl
0.24121405750798725
fr
0.24000000000000002
pl
0.19491525423728812
hi
0.18244406196213425
vi
0.17247386759581881
tr
0.16520210896309317
it
0.1532976827094474
es
0.14414414414414414
is
0.14414414414414414
ro
0.13003663003663005
reliable
true
passes
true
1
name
"Hunter"
dialogue
"\"Excuse me, ma’ Bomb.不影响实际发表时间."
detectedLang
"zh"
heavyLang
"zh"
scores
zh
0.7366440933032355
sq
0.3783303730017762
ja
0.37722419928825623
it
0.360146252285192
et
0.35779816513761464
fr
0.325626204238921
ms
0.2985971943887776
pl
0.23413566739606126
es
0.22566371681415928
no
0.20634920634920634
en
0.2009132420091324
pt
0.2009132420091324
sl
0.2009132420091324
tl
0.2009132420091324
nl
0.1935483870967742
hu
0.1898148148148148
sk
0.1898148148148148
ro
0.1822429906542056
vi
0.1822429906542056
yo
0.1822429906542056
fi
0.1764705882352941
lt
0.1566265060240964
reliable
true
passes
false
2
name
"Hunter"
dialogue
"I’m sorry, I don’t speak German. Do you know anything about the missing hiker last night?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8086192231358096
tl
0.681453515809344
da
0.4337248322147651
sq
0.4250425894378194
no
0.41760138050043144
sv
0.4155844155844156
nl
0.3807339449541285
it
0.3503368623676612
yo
0.33888344760039174
ms
0.33234421364985167
pt
0.32228915662650603
cs
0.31886982845610495
de
0.3133265513733469
ca
0.303405572755418
pl
0.2849576271186441
hr
0.2773019271948608
ro
0.2710583153347732
fr
0.24916573971078973
is
0.24157303370786518
et
0.23295454545454544
sk
0.23208191126279862
az
0.22680412371134018
vi
0.2049469964664311
tr
0.17177914110429449
sl
0.15094339622641512
fi
0.10358565737051792
be
0.09395973154362416
es
0.09395973154362416
hu
0.09029649595687331
lt
0.08536585365853659
lv
0.08536585365853659
uk
0.08536585365853659
reliable
true
passes
true
3
name
"Melanie"
dialogue
"Ich verstehe nicht... Aber ich habe Fotos. Vielleicht etwas... unheimlich."
detectedLang
"de"
heavyLang
"de"
scores
de
0.8226950354609929
nl
0.4627054361567636
fi
0.36377245508982037
lt
0.3410852713178295
es
0.3264659270998415
pt
0.3264659270998415
ca
0.3122977346278317
sk
0.31006493506493504
pl
0.2904841402337229
cs
0.28209459459459457
fr
0.2517605633802817
it
0.2410714285714286
en
0.23971377459749554
lv
0.23971377459749554
no
0.22303473491773307
sq
0.22303473491773307
az
0.21731123388581952
ro
0.21731123388581952
sv
0.21731123388581952
hu
0.1448692152917505
tr
0.1414141414141414
hr
0.1361788617886179
is
0.1361788617886179
sl
0.1361788617886179
et
0.1290983606557377
da
0.121900826446281
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Look, I’m trying to solve a case here. If you saw something, I need to know."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8291766313631704
tl
0.6105919003115265
cs
0.47145877378435525
hr
0.4232987312572088
pl
0.42263279445727486
yo
0.40688018979833923
sk
0.4004796163069544
no
0.39320388349514557
pt
0.38875305623471884
it
0.38800489596083226
da
0.38650306748466257
ro
0.3796526054590571
sq
0.3796526054590571
sv
0.3757802746566792
ca
0.3540051679586563
et
0.3464052287581699
lv
0.33774834437086093
lt
0.33244325767690247
ms
0.32523616734143057
hu
0.3084370677731674
sl
0.29676511954992973
es
0.29577464788732394
tr
0.22720247295208654
is
0.2163009404388715
az
0.21011058451816747
vi
0.21011058451816747
fr
0.18962722852512154
be
0.12280701754385964
eu
0.12280701754385964
nl
0.12280701754385964
de
0.11816578483245149
fi
0.11190053285968028
uk
0.11190053285968028
reliable
true
passes
true
5
name
"Melanie"
dialogue
"Hier. Sehen Sie."
detectedLang
"de"
heavyLang
"de"
scores
de
0.845679012345679
eu
0.5348837209302325
pl
0.5283018867924528
nl
0.5073891625615764
fi
0.46808510638297873
fr
0.46808510638297873
reliable
true
passes
true
6
name
"Hunter"
dialogue
"Okay... I can’t tell what I’m looking at, but if you think this is important—"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8283752860411898
tl
0.712800875273523
no
0.49904580152671757
fr
0.4664634146341463
da
0.4626407369498465
ca
0.45369406867845996
nl
0.4514106583072101
sv
0.41731409544950054
ro
0.4101123595505618
it
0.37125748502994016
pt
0.3379571248423708
is
0.3009320905459388
hu
0.29624664879356566
yo
0.2827868852459016
sq
0.2768595041322314
es
0.27285318559556787
cs
0.2584745762711864
tr
0.2222222222222222
hr
0.20933734939759033
lt
0.2033383915022762
pl
0.1935483870967742
de
0.13223140495867766
sk
0.12060301507537688
be
0.11764705882352942
az
0.10714285714285714
et
0.10714285714285714
lv
0.10714285714285714
uk
0.10714285714285714
vi
0.10714285714285714
reliable
true
passes
true
7
name
"Hunter"
dialogue
"Don’t... delete those photos."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8100664767331434
fr
0.5951417004048583
tl
0.5327102803738317
sq
0.4550408719346049
pt
0.4444444444444444
no
0.4428969359331476
es
0.42363112391930835
ro
0.4152046783625731
ca
0.40828402366863903
nl
0.40828402366863903
hu
0.29577464788732394
lt
0.2857142857142857
hi
0.2779783393501805
et
0.2727272727272727
sl
0.2727272727272727
tr
0.2727272727272727
de
0.26199261992619927
lv
0.26199261992619927
da
0.250936329588015
it
0.250936329588015
sv
0.250936329588015
reliable
true
passes
true
8
name
"Melanie"
dialogue
"Ich warte."
detectedLang
"de"
heavyLang
"de"
scores
de
0.8021108179419525
pl
0.7440273037542662
sk
0.6231155778894472
ro
0.5901639344262295
it
0.576271186440678
eu
0.5738636363636364
es
0.563953488372093
pt
0.5535714285714286
no
0.5283018867924528
ms
0.5098039215686274
nl
0.5
da
0.471830985915493
fr
0.471830985915493
sq
0.471830985915493
tl
0.471830985915493
reliable
true
passes
true