Run: 2026-02-08-01KGYZBK8SKTBZJW5D76YRPET5

Run Details

Duration:
1m 2s
Cost:
$0.008467
Provider:
Moonshot AI

Evaluation summary

Score Evaluator
Details Link
53.85% Parse dialogue (details)

Model input

System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."

Model response

AI
Hunter: "Miss? Detective Hunter, Alborn PD. I noticed you were photographing the orchard during the apple festival Saturday night. I need to ask about the disappearances." Selvi: [Clutching camera to chest, eyes wide] "Kya? Main... main aapki koi baat samajh nahi pa rahi hoon. Aap kaun?" Hunter: [Frowning] "I'm sorry, do you speak English at all? Anyone here who can translate?" Selvi: [Voice trembling, speaking rapidly] "Haan, main wahaan thi... raat ko... ped ke paas kuch tha... bahut bhayaanak! Main darr gayi hoon!" Hunter: "Slow down. I can't understand you. Are you saying you witnessed something? The missing persons?" Selvi: [Shaking head, holding out camera with trembling hands] "Nahi English! Dekhiye... yeh photo... wahaan ek aadmi tha... nahi, kuch aur... laal aankhein... oh Bhagwan!" Hunter: [Leaning in, confused] "Red eyes? What? Ma'am, is this evidence? Did you capture the perpetrator on camera?" Selvi: [Tears forming] "Woh mujhe dekh raha tha! Camera mein dekhiye! Mujhe bachaiye!" Hunter: [Phone rings] "Hunter. Yeah. What? When did they find the—? I'll be there in ten." Hunter: "Ma'am, I have to go. That's my partner. Do not leave this café. I need to see those photographs, but I have to respond to this call first." Selvi: [Standing up, desperate] "Kahan jaa rahe hain? Rukiye! Mujhe akela mat chhodiye! Woh phir se aayega!" Hunter: [Backing away] "I'll send Officer Chen—she speaks Hindi. Just stay here!" Selvi: [Calling after him] "Suniye! Please! Woh cheez insaan nahi hai!"
created_at"2026-02-08T15:55:07.309563+00:00" 2/8/2026, 3:55:07 PM
model"moonshotai/kimi-k2.5-0127"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency762
moderation_latency(null)
generation_time61727
tokens_prompt262
tokens_completion2736
native_tokens_prompt278
native_tokens_completion2813
native_tokens_completion_images(null)
native_tokens_reasoning2498
native_tokens_cached278
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0084668
router(null)
provider_responses
0
id"chatcmpl-6988b19df07325e631b366df"
status200
is_byokfalse
latency762
endpoint_id"49762299-66fd-4514-b0ee-c9e11e231e3b"
provider_name"Moonshot AI"
model_permaslug"moonshotai/kimi-k2.5-0127"
api_type"completions"
id"gen-1770566044-0Jk8oxGPgPW3ClZSYUmo"
upstream_id"chatcmpl-6988b19df07325e631b366df"
total_cost0.0084668
cache_discount0.000139
upstream_inference_cost0
provider_name"Moonshot AI"

Evaluation details

Result Evaluator Details Meta Data
53.85% Parse dialogue n/a
dialogue
0
name"Hunter"
dialogue"Miss? Detective Hunter, Alborn PD. I noticed you were photographing the orchard during the apple festival Saturday night. I need to ask about the disappearances."
detectedLang"en"
heavyLang"en"
scores
en0.7902848224736637
tl0.5929572131768269
fr0.5868562644119908
no0.4725220804710501
it0.4611528822055138
da0.4470164609053498
nl0.42235357334766255
ca0.40574903261470424
ro0.3929983060417843
es0.3912797281993205
pt0.3836009174311926
ms0.3807603686635945
sv0.38004613610149945
de0.3676470588235294
cs0.3476941747572816
hr0.3272841051314143
lv0.32517263025737597
sq0.3148502230720204
sl0.2876076872100729
yo0.27364864864864863
sk0.26920462270564244
lt0.2677111716621254
et0.2315939957112223
pl0.2315939957112223
az0.14410828025477707
hu0.11083540115798181
tr0.10714285714285714
hi0.06684027777777778
be0.061135371179039305
fi0.05536028119507908
is0.05536028119507908
uk0.05536028119507908
vi0.05536028119507908
reliabletrue
passestrue
1
name"Selvi"
dialogue"Kya? Main... main aapki koi baat samajh nahi pa rahi hoon. Aap kaun?"
detectedLang"tl"
heavyLang"tl"
scores
tl0.40878378378378377
lv0.3670886075949367
eu0.32038834951456313
sl0.3
hr0.23580786026200873
sq0.2324561403508772
et0.2290748898678414
fi0.2290748898678414
yo0.2273730684326711
is0.22566371681415928
no0.2222222222222222
en0.22048997772828507
fr0.2009132420091324
da0.19724770642201836
lt0.19168591224018475
sv0.18604651162790697
de0.1822429906542056
ms0.16864608076009502
reliablefalse
passesfalse
2
name"Hunter"
dialogue"I'm sorry, do you speak English at all? Anyone here who can translate?"
detectedLang"en"
heavyLang"en"
scores
en0.8115079365079364
tl0.6239113222486145
no0.4897959183673469
sq0.4318181818181818
pt0.41430332922318125
hr0.4099378881987578
da0.39490445859872614
it0.39490445859872614
cs0.36241610738255037
ro0.33473389355742295
pl0.309593023255814
de0.30758017492711365
sv0.2994100294985251
vi0.2952522255192878
sk0.291044776119403
sl0.27370030581039756
ca0.24242424242424243
et0.23633440514469453
tr0.22764227642276424
az0.22258592471358432
ms0.21617161716171615
fr0.14414414414414414
yo0.1410488245931284
es0.1394927536231884
eu0.12361623616236161
lt0.12361623616236161
lv0.12361623616236161
nl0.1171003717472119
reliabletrue
passestrue
3
name"Selvi"
dialogue"Haan, main wahaan thi... raat ko... ped ke paas kuch tha... bahut bhayaanak! Main darr gayi hoon!"
detectedLang"ms"
heavyLang"vi"
scores
ms0.4131455399061033
sq0.30747922437673125
yo0.3045897079276773
tl0.30362116991643456
eu0.28977272727272724
vi0.2366412213740458
sl0.20127795527156547
lv0.19743178170144465
fi0.17355371900826447
cs0.17081260364842457
en0.1652754590984975
hu0.1652754590984975
lt0.15966386554621848
fr0.14965986394557826
hr0.14383561643835618
de0.13494809688581316
tr0.13043478260869565
reliablefalse
passesfalse
4
name"Hunter"
dialogue"Slow down. I can't understand you. Are you saying you witnessed something? The missing persons?"
detectedLang"en"
heavyLang"en"
scores
en0.8056775937816829
da0.560733384262796
no0.5483110761979576
sv0.5479559748427674
tl0.5450949367088607
nl0.4715073529411765
fr0.4156504065040651
ro0.40782698249227595
ms0.38105489773950485
is0.37971952535059333
it0.375
de0.37158469945355194
cs0.3618201997780244
ca0.34285714285714286
sq0.33449074074074076
pl0.31384248210023863
tr0.29187192118226607
sk0.2767295597484277
lv0.2721518987341772
pt0.267515923566879
hr0.2561448900388098
yo0.25227568270481143
az0.24540682414698162
et0.24540682414698162
es0.22402159244264505
fi0.222972972972973
lt0.19014084507042253
hu0.1843971631205674
be0.10852713178294573
uk0.0987460815047022
vi0.0987460815047022
reliabletrue
passestrue
5
name"Selvi"
dialogue"Nahi English! Dekhiye... yeh photo... wahaan ek aadmi tha... nahi, kuch aur... laal aankhein... oh Bhagwan!"
detectedLang"nl"
heavyLang"vi"
scores
nl0.4373776908023483
en0.4004171011470281
fr0.2483660130718954
de0.2353723404255319
ms0.2353723404255319
ro0.2353723404255319
eu0.222972972972973
sq0.16907514450867053
tr0.16302765647743814
fi0.15441176470588236
cs0.13273001508295626
no0.1248097412480974
hi0.11809815950920245
tl0.11809815950920245
da0.11128284389489954
hr0.11128284389489954
sv0.11128284389489954
reliablefalse
passesfalse
6
name"Hunter"
dialogue"Red eyes? What? Ma'am, is this evidence? Did you capture the perpetrator on camera?"
detectedLang"en"
heavyLang"en"
scores
en0.7802636835797043
tl0.5749613601236476
fr0.5370370370370371
es0.47267497603068076
nl0.4516450648055832
ca0.4494494494494495
yo0.4444444444444444
it0.4222689075630252
ro0.41113490364025695
pt0.3902439024390244
ms0.3699885452462772
hr0.36046511627906974
cs0.359720605355064
sl0.3460166468489893
sq0.3460166468489893
de0.34210526315789475
no0.28848641655886154
sv0.28664072632944226
et0.28104575163398693
da0.2801047120418848
hu0.21875
pl0.21540656205420827
fi0.21428571428571427
sk0.2109038737446198
lv0.20634920634920634
is0.14330218068535827
hi0.13249211356466878
tr0.1085899513776337
reliabletrue
passestrue
7
name"Selvi"
dialogue"Woh mujhe dekh raha tha! Camera mein dekhiye! Mujhe bachaiye!"
detectedLang"de"
heavyLang"de"
scores
de0.360146252285192
it0.3110236220472441
ro0.3110236220472441
et0.28716904276985744
sq0.2505353319057816
ca0.22566371681415928
es0.22048997772828507
sv0.1898148148148148
en0.1784037558685446
fr0.1784037558685446
nl0.1784037558685446
ms0.17061611374407584
tl0.17061611374407584
reliablefalse
passesfalse
8
name"Hunter"
dialogue"Hunter. Yeah. What? When did they find the—? I'll be there in ten."
detectedLang"en"
heavyLang"en"
scores
en0.824438202247191
de0.5241116751269036
nl0.5123537061118335
yo0.4485294117647059
tl0.4131455399061033
da0.4094488188976378
sv0.40476190476190477
sq0.3644067796610169
sl0.36224489795918363
ms0.36115843270868825
it0.358974358974359
lt0.3556701030927836
ro0.34895833333333337
pl0.3397887323943662
no0.31569343065693434
fr0.2857142857142857
hu0.2378048780487805
cs0.20886075949367086
sk0.20886075949367086
az0.1830065359477124
pt0.14383561643835618
ca0.1359447004608295
tr0.1359447004608295
reliabletrue
passestrue
9
name"Hunter"
dialogue"Ma'am, I have to go. That's my partner. Do not leave this café. I need to see those photographs, but I have to respond to this call first."
detectedLang"en"
heavyLang"en"
scores
en0.7979287698913867
tl0.5798319327731093
fr0.493350221659278
pl0.45725915875169604
da0.4269340974212034
cs0.42363112391930835
sq0.4211287988422576
sk0.40476190476190477
pt0.40119760479041916
no0.3635640413683373
et0.355877616747182
hr0.355877616747182
yo0.35379644588045234
nl0.33774834437086093
it0.30915371329879104
sv0.3006993006993007
ca0.2812219227313567
sl0.2779783393501805
lt0.27601809954751133
ro0.24882629107981222
lv0.24314096499526963
tr0.21182266009852216
az0.2102665350444225
es0.1861648016276704
vi0.17864476386036962
de0.17355371900826447
hu0.13978494623655913
eu0.09090909090909091
hi0.08779931584948689
be0.08045977011494253
ms0.08045977011494253
is0.07300115874855156
uk0.07300115874855156
reliabletrue
passestrue
10
name"Selvi"
dialogue"Kahan jaa rahe hain? Rukiye! Mujhe akela mat chhodiye! Woh phir se aayega!"
detectedLang"tl"
heavyLang"tr"
scores
tl0.3564993564993565
tr0.34725848563968675
ms0.29873772791023845
fi0.2753623188405797
sv0.24357034795764
no0.23076923076923078
eu0.2163009404388715
cs0.21383647798742136
hr0.21011058451816747
sl0.20886075949367086
es0.20634920634920634
ro0.1935483870967742
pt0.19224555735056545
sq0.1830065359477124
fr0.17898193760262726
yo0.17627677100494232
it0.1624790619765494
ca0.1610738255033557
da0.15540540540540537
is0.14383561643835618
lt0.13043478260869565
sk0.13043478260869565
reliablefalse
passesfalse
11
name"Hunter"
dialogue"I'll send Officer Chen—she speaks Hindi. Just stay here!"
detectedLang"en"
heavyLang"en"
scores
en0.7549019607843137
tl0.6412803532008831
et0.5075757575757576
sq0.365234375
sv0.3486973947895792
sl0.26136363636363635
it0.2388758782201405
is0.2316784869976359
fr0.20147420147420148
nl0.20147420147420148
no0.20147420147420148
hi0.1935483870967742
ca0.19154228855721395
da0.181360201511335
de0.181360201511335
reliabletrue
passestrue
12
name"Selvi"
dialogue"Suniye! Please! Woh cheez insaan nahi hai!"
detectedLang"en"
heavyLang"de"
scores
en0.5655608214849921
tr0.44668008048289737
ms0.40860215053763443
tl0.39293598233995586
eu0.375
vi0.36342592592592593
az0.30730478589420657
fi0.28756476683937826
it0.26666666666666666
ro0.26666666666666666
sv0.21875
reliablefalse
passesfalse