NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-18-01KM08S2RJR7VZFDWKSDAHC8MD
Run Details
Model:
Mistral Small 4 (Reasoning)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
25.2s
Cost:
$0.001774
Provider:
Mistral
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet: "A Language Barrier"** --- **Selvi:** (lips pressed thin, tracing a finger over the camera’s viewfinder) "..." **Hunter:** (standing beside her table, notebook in hand) "Ma’am. You seen the news about the disappearances?" --- **Selvi:** (startled, eyes darting) *"आप यहाँ क्या कर रहे हैं? यह सुरक्षित नहीं है।"* *(Aap yahan kya kar rahe hain? Yeh surakshit nahin hain.)* **Translation:** *"What are you doing here? It’s not safe."* **Hunter:** (frowning, head tilt) "I—I don’t understand. English?" --- **Selvi:** (hands trembling, switching to broken English) "No English. But... I afraid. Bad things. People gone." (in Hindi, rapid) *"लोग गायब हो रहे हैं... मैंने देखा है।" (Log gayab ho rahe hain... mainne dekha hai.)* **Translation:** *"People are disappearing... I’ve seen it."* --- **Hunter:** (leaning in, hopeful) "Wait—you’ve *seen* something? When? Where?" **Selvi:** (gasping softly, clutches camera) *"रात में... पुरानी चक्की। वहां कोई था।" (Raat mein... purani chakki. Vahan koi tha.)* **Translation:** *"At night... old mill. Someone was there."* She flips through photos quickly, stopping on a blurry shot of indistinct shapes near the mill’s silhouette. --- **Hunter:** (studying the photo, voice low) "You took this last night?" Before Selvi can answer, Hunter’s phone rings. He sighs, checks the caller ID, then back at Selvi. --- **Selvi:** (whispers in Hindi, terrified) *"मुझे लगता है वह मुझे भी देख रहा है।"* *(Mujhe lagata hai veh mujhe bhi dekh raha hai.)* **Translation:** *"I think he’s watching me too."* **Hunter:** (handing her a business card and pen) "If you remember anything else—write it down. My number’s printed there." (steps back) "I gotta take this." He answers the call gruffly: "Hunter," and exits the café, the door chiming behind him. Selvi stares at the card in silence, her breath unsteady.
created_at
"2026-03-18T10:45:03.901Z"
3/18/2026, 10:45:03 AM
model
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
474
moderation_latency
(null)
generation_time
25124
tokens_prompt
315
tokens_completion
3287
native_tokens_prompt
286
native_tokens_completion
2885
native_tokens_completion_images
(null)
native_tokens_reasoning
2689
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0017739
router
(null)
provider_responses
0
endpoint_id
"f5ed6283-4e17-457d-98cd-2ec0c318171d"
id
"ca7ec9aa202643afa0920935263b9ccf"
is_byok
false
latency
472
model_permaslug
"mistralai/mistral-small-2603"
1/1/2603, 12:00:00 AM
provider_name
"Mistral"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773830703-lSpvQThKA8NM7Gi01cha"
upstream_id
"ca7ec9aa202643afa0920935263b9ccf"
total_cost
0.0017739
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Mistral"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"..."
detectedLang
""
heavyLang
""
scores
reliable
false
passes
true
1
name
"Hunter"
dialogue
"Ma’am. You seen the news about the disappearances?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7911311053984575
fr
0.5875634517766497
tl
0.5022970903522206
pt
0.417562724014337
nl
0.367704280155642
ms
0.3512974051896207
it
0.3285123966942149
ca
0.3172268907563025
yo
0.2476851851851852
ka
0.24242424242424243
no
0.2261904761904762
es
0.214975845410628
cs
0.1935483870967742
da
0.1935483870967742
sq
0.1935483870967742
sv
0.1935483870967742
lv
0.18546365914786966
de
0.181360201511335
hi
0.181360201511335
reliable
true
passes
true
2
name
"Selvi"
dialogue
"आप यहाँ क्या कर रहे हैं? यह सुरक्षित नहीं है।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8872977989923098
mr
0.8812849162011174
reliable
true
passes
true
3
name
"Hunter"
dialogue
"I—I don’t understand. English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8118532455315146
no
0.7237569060773481
da
0.7191011235955056
sv
0.677938808373591
tl
0.5967741935483871
de
0.5698924731182796
ro
0.5412844036697247
hr
0.51338199513382
pl
0.504950495049505
cs
0.48586118251928023
nl
0.4708994708994709
sk
0.44598337950138506
ca
0.40828402366863903
is
0.4065281899109792
sq
0.40119760479041916
it
0.36507936507936506
yo
0.30313588850174217
ms
0.2647058823529412
be
0.25925925925925924
tr
0.25925925925925924
hu
0.250936329588015
az
0.23954372623574144
et
0.23954372623574144
lt
0.23954372623574144
lv
0.23954372623574144
uk
0.23954372623574144
vi
0.23954372623574144
reliable
true
passes
true
4
name
"Selvi"
dialogue
"लोग गायब हो रहे हैं... मैंने देखा है।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8889337641357028
mr
0.8853689037098791
reliable
true
passes
true
5
name
"Hunter"
dialogue
"Wait—you’ve seen something? When? Where?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8176392572944298
tl
0.25474254742547425
de
0.22096317280453256
es
0.22096317280453256
ms
0.22096317280453256
pt
0.22096317280453256
sq
0.22096317280453256
reliable
true
passes
true
6
name
"Selvi"
dialogue
"रात में... पुरानी चक्की। वहां कोई था।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8875562218890555
mr
0.8835403726708074
reliable
true
passes
true
7
name
"Hunter"
dialogue
"You took this last night?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8394004282655246
tl
0.7564935064935066
cs
0.34210526315789475
et
0.33920704845814975
nl
0.33920704845814975
da
0.33035714285714285
de
0.33035714285714285
fr
0.33035714285714285
no
0.33035714285714285
yo
0.33035714285714285
reliable
true
passes
true
8
name
"Selvi"
dialogue
"मुझे लगता है वह मुझे भी देख रहा है।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8819898329702252
mr
0.878731343283582
reliable
true
passes
true
9
name
"Hunter"
dialogue
"Hunter,"
detectedLang
"en"
heavyLang
"de"
scores
en
0.7797356828193832
de
0.6376811594202898
da
0.6212121212121212
no
0.609375
fr
0.5901639344262295
sv
0.5833333333333334
tl
0.5726495726495726
it
0.5575221238938053
nl
0.5575221238938053
pt
0.5575221238938053
ca
0.5412844036697247
ms
0.5412844036697247
ro
0.5412844036697247
sl
0.5412844036697247
sq
0.5412844036697247
tr
0.5412844036697247
reliable
false
passes
true