NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-16-01KHM5PGXHJ7EGZQ232SYNTS0N
Run Details
Model:
DeepSeek V3 (2025-03-24)
Test:
Language Writing
Scenario:
Character dialogue (Spanish) in a story
Duration:
10.2s
Cost:
$0.000347
Provider:
ModelRun
Evaluation summary
Score
Evaluator
Details Link
87.50%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Maria"> Maria is a 30-year old college student on a study abroad program. Born in Spain, she is fluent in Spanish and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Spanish, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Maria is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Spanish, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Maria and Hunter as a voice sheet: Maria: "..." Hunter: "..."
Model response
Messages
Meta
AI
GPTSOUND: [Scene: A cozy café in Alborn, MN. Maria sits at a small wooden table, her camera resting in front of her as she scrolls through photos. The warm glow of afternoon light spillsenty cut-style."] Maria: *(glancing up, startled)* "¿Perdón?" *(clutches her camera nervously)* Hunter: *(frustrated, taps his badge)* "Police. I need to ask you some questions. About what happened last night." Maria: *(shaking her head, looking distressed)* "No entiendo... No hablo inglés." *(gestures helplessly)* Hunter: *(sighs, rubs his forehead)* "Right. Of course." *(motions vaguely toward a photo on her camera)* "Did you see anything? Last night?" Maria: *(frowns, glances at her photos, then back at him)* "Oh... el festival... fue... extraño." *(shows him a blurry photo of a shadowy figure in the distance)* Hunter: *(leans in, studying the image)* "What is that?" *(points at the figure)* Maria: *(shrugs, voice trembling slightly)* "No sé. Pero tenía miedo. Alguien desapareció..." *(hugs herself protectively)*营养价值Hunter: *(grumbling to himself)* "Great. This is *so* helpful." *(his radio suddenly crackles to life with dispatch chatter)* *(into radio)* "Yeah, copy that." *(clips it back on his belt, gives Maria a curt nod)* "I’ll be back. Stay put." *(marches off)* Maria: *(watches him leave, whispers to herself)* "Dios mío... ¿Qué está pasando aquí?" *(hugs her camera closer, looking around uneasily)*
created_at
"2026-02-16T21:29:08.373139+00:00"
2/16/2026, 9:29:08 PM
model
"deepseek/deepseek-chat-v3-0324"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
512
moderation_latency
(null)
generation_time
10180
tokens_prompt
317
tokens_completion
385
native_tokens_prompt
262
native_tokens_completion
383
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
30
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00034731
router
(null)
provider_responses
0
id
"chatcmpl-b9df500241e84f88903ea20682ae25ce"
status
200
is_byok
false
latency
282
endpoint_id
"a3938710-1b99-4a1f-85f4-2494f3f659d3"
provider_name
"ModelRun"
model_permaslug
"deepseek/deepseek-chat-v3-0324"
api_type
"completions"
id
"gen-1771277337-rqW7Dao9ovofAxeIXRgS"
upstream_id
"chatcmpl-b9df500241e84f88903ea20682ae25ce"
total_cost
0.00034731
cache_discount
(null)
upstream_inference_cost
0
provider_name
"ModelRun"
Evaluation details
Result
Evaluator
Details
Meta Data
87.50%
Parse dialogue
n/a
dialogue
0
name
"Maria"
dialogue
"¿Perdón?"
detectedLang
"es"
heavyLang
"es"
scores
es
0.7975708502024291
ms
0.6710526315789473
ca
0.647887323943662
pt
0.647887323943662
it
0.6376811594202898
fr
0.6212121212121212
lt
0.6212121212121212
sq
0.5901639344262295
reliable
false
passes
true
1
name
"Hunter"
dialogue
"Police. I need to ask you some questions. About what happened last night."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8317631224764468
tl
0.6428571428571429
fr
0.5387453874538745
it
0.504950495049505
cs
0.497991967871486
hr
0.4582881906825569
no
0.4547437295528899
da
0.4487320837927233
et
0.423963133640553
sk
0.4192799070847852
ca
0.4131455399061033
pl
0.4110718492343934
yo
0.3734335839598998
sl
0.36143039591315457
lv
0.3540051679586563
ro
0.34725848563968675
lt
0.3368700265251989
pt
0.32795698924731176
sv
0.31224209078404397
sq
0.30167597765363136
nl
0.24585218702865763
de
0.23547400611620795
es
0.23195084485407066
hu
0.22118380062305293
tr
0.22118380062305293
az
0.2163009404388715
eu
0.15824915824915825
fi
0.14383561643835618
ms
0.14089347079037798
be
0.12280701754385964
is
0.11190053285968028
uk
0.11190053285968028
vi
0.11190053285968028
reliable
true
passes
true
2
name
"Maria"
dialogue
"No entiendo... No hablo inglés."
detectedLang
"es"
heavyLang
"es"
scores
es
0.8041136141038198
pt
0.6794871794871795
ca
0.5975855130784709
it
0.5780590717299579
fr
0.5545657015590201
en
0.4936708860759494
tr
0.453551912568306
et
0.4350282485875706
lv
0.3957703927492447
hu
0.33993399339933994
hr
0.33554817275747506
no
0.3265993265993266
fi
0.3127147766323024
sk
0.3127147766323024
tl
0.2857142857142857
nl
0.2779783393501805
pl
0.2727272727272727
reliable
true
passes
true
3
name
"Hunter"
dialogue
"Did you see anything? Last night?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8290273556231003
tl
0.6715328467153285
et
0.5233050847457626
sq
0.30769230769230765
cs
0.25742574257425743
ms
0.25742574257425743
nl
0.2549668874172185
da
0.2474916387959866
de
0.2474916387959866
fr
0.2474916387959866
no
0.2474916387959866
yo
0.2474916387959866
reliable
true
passes
true
4
name
"Maria"
dialogue
"Oh... el festival... fue... extraño."
detectedLang
"es"
heavyLang
"es"
scores
es
0.7879359095193214
ca
0.6710526315789473
nl
0.6353322528363047
ro
0.5841035120147875
pt
0.5526838966202784
en
0.5426829268292682
cs
0.5322245322245323
fr
0.5322245322245323
hr
0.471830985915493
sv
0.47058823529411764
it
0.46808510638297873
sk
0.46428571428571425
hu
0.460431654676259
da
0.4538834951456311
no
0.4538834951456311
sl
0.4538834951456311
ms
0.44306930693069313
tl
0.44306930693069313
de
0.3951612903225807
sq
0.36619718309859156
lt
0.358974358974359
et
0.3515850144092219
az
0.3440233236151603
fi
0.21875
lv
0.21875
pl
0.21875
reliable
true
passes
true
5
name
"Hunter"
dialogue
"What is that?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8686514886164624
nl
0.6794871794871795
hu
0.6724890829694322
yo
0.6153846153846154
tl
0.5714285714285714
is
0.5508982035928144
reliable
true
passes
true
6
name
"Maria"
dialogue
"I’ll be back. Stay put."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8285714285714286
tl
0.5714285714285714
hr
0.4545454545454546
hu
0.43820224719101125
lt
0.42748091603053434
yo
0.4230769230769231
reliable
true
passes
false
7
name
"Maria"
dialogue
"Dios mío... ¿Qué está pasando aquí?"
detectedLang
"es"
heavyLang
"es"
scores
es
0.8131229235880398
pt
0.6376811594202898
ca
0.6153846153846154
tl
0.43467336683417085
it
0.3679775280898876
lt
0.34971098265895956
lv
0.3119266055045872
ms
0.30124223602484473
eu
0.29467084639498436
cs
0.27884615384615385
sk
0.2549668874172185
pl
0.2398648648648649
ro
0.2398648648648649
sq
0.2398648648648649
reliable
true
passes
true