Run: 2026-02-08-01KGYY7B0K2VA8GWHB2018XTN3

Run Details

Duration:
1m 3s
Cost:
$0.009802
Provider:
Moonshot AI

Evaluation summary

Score Evaluator
Details Link
53.85% Parse dialogue (details)

Model input

System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."

Model response

AI
Hunter: "Miss? Detective Hunter, Alborn Sheriff's Department. You’ve been taking pictures around the Jensen orchard?" Selvi: *tightening grip on camera* "Police? Oh Bhagwan... maine socha tha yeh sapna hai..." (Police? Oh God... I thought this was a dream...) Hunter: "I need to ask about last night. During the festival setup. Did you see anything unusual at the cider barn?" Selvi: "Kal raat... haan, main wahan thi. Photography ke liye..." (Last night... yes, I was there. For photography...) Hunter: "I'm sorry, I don't speak Hindi. Do you know any English at all? Any?" Selvi: *shaking head frantically* "Nahi... sirf thodi si... nahi..." (No... only a little... no...) Hunter: "Alright. Look." *makes walking motion with two fingers* "Person. Man. Did you see a man?" Selvi: "Aadmi? Haan! Bada aadmi... usne kuch khichta hua... jaise koi cheez..." (Man? Yes! A big man... he was pulling something... like an object...) Hunter: "Pulling? Dragging? Was someone hurt?" Selvi: *thrusting camera at him* "Photo! Yeh dekho... ped ke peeche... woh chaya..." (Photo! Look at this... behind the tree... that shadow...) Hunter: *squinting at screen* "I see apple trees. It's just the orchard in the dark." Selvi: "Nahi! Woh aadmi... uska haath... aur phir woh gayab ho gaya... hawa mein..." (No! That man... his hand... and then he disappeared... into the air...) Hunter: "Disappeared? Miss, three people have vanished from this town in two weeks. If you witnessed something—" Selvi: *voice trembling* "Teen log? Toh agla number mera?" (Three people? So I'm next?) Hunter: "I can't understand you. This is useless." *rubs temples* "Camera. May I?" Selvi: "Haan, lekin savdhaan... usmein woh..." (Yes, but be careful... in it, he...) Hunter: "The lens is cracked. I can't make out anything except branches." Selvi: *pointing urgently* "Wahan! Uss jagah par... flash ke roshni mein..." (There! At that spot... in the flash of light...) Hunter: *phone rings* "Hunter. ... You're kidding. When did they find it? ... I'm at the Bean Counter. ... Ten minutes." Hunter: "I have to go. They found something down by the creek." Selvi: *grabbing his sleeve* "Please mat jao! Woh yahin hai... mujhe lag raha hai..." (Please don't go! He's here... I can feel it...) Hunter: "I'm sorry. I need a translator, and I need one now, but this can't wait." *pulls out card* "Stay here. Don't leave. Don't delete those photos." Selvi: "Aap wapas aaoge?" (Will you come back?) Hunter: "Just... stay put. I'll send Officer Chen. She speaks Punjabi—maybe close enough?" Selvi: "Jaldi kijiye... dar lag raha hai..." (Please hurry... I'm scared...) Hunter: "Yeah. Me too." *rushes out*
created_at"2026-02-08T15:35:20.523206+00:00" 2/8/2026, 3:35:20 PM
model"moonshotai/kimi-k2.5-0127"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency981
moderation_latency(null)
generation_time62966
tokens_prompt262
tokens_completion3125
native_tokens_prompt278
native_tokens_completion3258
native_tokens_completion_images(null)
native_tokens_reasoning2623
native_tokens_cached278
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0098018
router(null)
provider_responses
0
status429
is_byokfalse
latency116
endpoint_id"c884a526-d529-4263-bd90-a58ea25e99d1"
provider_name"Venice"
model_permaslug"moonshotai/kimi-k2.5-0127"
1
id"chatcmpl-6988acf90ee6f4e48f67e2cf"
status200
is_byokfalse
latency981
endpoint_id"49762299-66fd-4514-b0ee-c9e11e231e3b"
provider_name"Moonshot AI"
model_permaslug"moonshotai/kimi-k2.5-0127"
api_type"completions"
id"gen-1770564856-cLqi92d3XJ6QmVeVnHk2"
upstream_id"chatcmpl-6988acf90ee6f4e48f67e2cf"
total_cost0.0098018
cache_discount0.000139
upstream_inference_cost0
provider_name"Moonshot AI"

Evaluation details

Result Evaluator Details Meta Data
53.85% Parse dialogue n/a
dialogue
0
name"Hunter"
dialogue"Miss? Detective Hunter, Alborn Sheriff's Department. You’ve been taking pictures around the Jensen orchard?"
detectedLang"en"
heavyLang"en"
scores
en0.7608418367346939
tl0.6392496392496393
no0.5265151515151516
fr0.5170637475853187
nl0.4755244755244755
da0.4533527696793003
ro0.4533527696793003
ca0.4427934621099554
de0.44071588366890374
sq0.4172494172494172
sv0.41222570532915365
pt0.3321460373998219
es0.24774322968906717
it0.23935091277890466
ms0.2299794661190965
sl0.20802534318901794
pl0.18743228602383533
yo0.1847826086956522
et0.18389553862894453
tr0.16201117318435754
cs0.15254237288135594
sk0.1515837104072398
az0.14965986394557826
hr0.14772727272727273
lv0.09963985594237695
fi0.09638554216867469
is0.08647990255785627
lt0.08424908424908424
hu0.07749077490774908
reliabletrue
passestrue
1
name"Selvi"
dialogue"Police? Oh Bhagwan... maine socha tha yeh sapna hai..."
detectedLang"fr"
heavyLang"ber"
scores
fr0.5138888888888888
en0.4897959183673469
ro0.47994056463595847
it0.4318181818181818
et0.4186046511627907
tl0.41471571906354515
fi0.41373534338358464
de0.3934142114384749
nl0.3761140819964349
sq0.36247723132969034
cs0.34210526315789475
lv0.33586337760910817
pt0.3333333333333333
hr0.32432432432432434
sl0.31640625
sk0.30966469428007887
vi0.30966469428007887
eu0.21171171171171171
da0.20454545454545453
no0.20273348519362186
sv0.20273348519362186
pl0.2009132420091324
lt0.19908466819221968
ms0.1898148148148148
az0.1764705882352941
es0.1764705882352941
hu0.1764705882352941
ca0.17061611374407584
tr0.17061611374407584
reliablefalse
passesfalse
2
name"Hunter"
dialogue"I need to ask about last night. During the festival setup. Did you see anything unusual at the cider barn?"
detectedLang"en"
heavyLang"en"
scores
en0.7962910930036528
tl0.6330971659919028
no0.593609865470852
da0.5908577878103838
et0.4825124910778016
sv0.46375739644970415
nl0.4273301737756714
ms0.4100895036615134
cs0.4086460032626427
hr0.3861134631668078
sq0.3787489288774636
fr0.36010591350397175
yo0.3584070796460177
lt0.35152057245080504
ca0.3217960710944809
pl0.30886558627264066
sk0.30754536771728747
it0.3015414258188825
de0.29816069699903197
sl0.29816069699903197
pt0.27716849451645065
ro0.27644710578842313
es0.2639593908629441
az0.25641025641025644
lv0.25488180883864336
is0.25257731958762886
tr0.16087962962962962
hu0.1559953434225844
be0.0880503144654088
fi0.0799492385786802
uk0.0799492385786802
vi0.0799492385786802
reliabletrue
passestrue
3
name"Selvi"
dialogue"Kal raat... haan, main wahan thi. Photography ke liye..."
detectedLang"en"
heavyLang"vi"
scores
en0.6046228710462287
fr0.5454545454545454
ms0.5409604519774012
tl0.3902439024390244
vi0.3229166666666667
cs0.24065420560747666
fi0.21307506053268765
yo0.21307506053268765
de0.1935483870967742
hi0.19154228855721395
reliabletrue
passesfalse
4
name"Hunter"
dialogue"I'm sorry, I don't speak Hindi. Do you know any English at all? Any?"
detectedLang"en"
heavyLang"en"
scores
en0.7956403269754768
tl0.6705710102489019
no0.5071193866374589
sq0.47674418604651164
it0.4636471990464839
da0.4617224880382775
hr0.4238156209987196
cs0.412532637075718
sv0.38524590163934425
ca0.37586685159500693
pl0.37413073713490963
pt0.3626062322946175
et0.35251798561151076
sk0.3119266055045872
vi0.27884615384615385
yo0.2682926829268293
ro0.2647058823529412
is0.26350245499181674
de0.24242424242424243
sl0.21052631578947367
hi0.14772727272727273
ms0.13793103448275862
be0.1346153846153846
tr0.1346153846153846
hu0.12959381044487428
az0.12280701754385964
lt0.12280701754385964
lv0.12280701754385964
uk0.12280701754385964
reliabletrue
passestrue
5
name"Selvi"
dialogue"Nahi... sirf thodi si... nahi..."
detectedLang"eu"
heavyLang"tl"
scores
eu0.5689655172413793
tl0.5265151515151516
ro0.5229007633587787
it0.5078740157480316
sq0.50199203187251
sk0.4959677419354839
yo0.4959677419354839
cs0.4877049180327869
sl0.4703389830508474
ca0.4588744588744588
es0.45652173913043476
fr0.44196428571428575
no0.3961352657004831
lt0.3654822335025381
reliabletrue
passesfalse
6
name"Hunter"
dialogue"Person. Man. Did you see a man?"
detectedLang"en"
heavyLang"en"
scores
en0.818087318087318
tl0.6875
sv0.6710526315789473
da0.666030534351145
de0.6527777777777778
no0.647887323943662
et0.6252676659528907
pt0.6136865342163355
cs0.6058558558558559
lv0.5911214953271028
lt0.5862884160756502
nl0.5833333333333334
ca0.5635910224438903
ro0.5603015075376884
es0.5569620253164557
sq0.5569620253164557
hu0.5558375634517766
it0.5489690721649484
is0.5442708333333333
fr0.5345744680851064
sk0.5192307692307693
hr0.4942196531791907
pl0.48377581120943947
ms0.4631901840490798
tr0.4514106583072101
az0.42434210526315785
yo0.40476190476190477
sl0.33962264150943394
eu0.2857142857142857
vi0.2857142857142857
fi0.2647058823529412
reliabletrue
passestrue
7
name"Selvi"
dialogue"Aadmi? Haan! Bada aadmi... usne kuch khichta hua... jaise koi cheez..."
detectedLang"en"
heavyLang"vi"
scores
en0.3433395872420263
eu0.29149797570850206
fr0.2374727668845316
et0.1822429906542056
reliablefalse
passesfalse
8
name"Hunter"
dialogue"Pulling? Dragging? Was someone hurt?"
detectedLang"en"
heavyLang"en"
scores
en0.7508305647840532
nl0.6621621621621622
no0.5681381957773513
tl0.5341614906832298
de0.471830985915493
is0.46428571428571425
da0.45121951219512196
ms0.4375
sv0.43467336683417085
az0.30769230769230765
yo0.27884615384615385
hr0.2694805194805195
ro0.2694805194805195
sl0.2694805194805195
es0.25742574257425743
pt0.25742574257425743
reliabletrue
passestrue
9
name"Selvi"
dialogue"Photo! Yeh dekho... ped ke peeche... woh chaya..."
detectedLang"ms"
heavyLang"ber"
scores
ms0.4614003590664273
cs0.3890020366598778
fr0.3877551020408163
en0.3630573248407643
ro0.25925925925925924
yo0.22680412371134018
hi0.20424403183023873
tl0.20424403183023873
reliablefalse
passesfalse
10
name"Hunter"
dialogue"I see apple trees. It's just the orchard in the dark."
detectedLang"en"
heavyLang"en"
scores
en0.8003685503685504
sv0.6055825242718447
et0.5723684210526315
nl0.5448179271708684
de0.5350500715307582
tl0.5303468208092486
it0.5149253731343284
fr0.496124031007752
ca0.4921875
no0.4921875
yo0.4645799011532125
ro0.41651705565529623
da0.4003690036900369
sq0.39478584729981375
pl0.3925233644859813
sl0.32150313152400833
ms0.3200836820083682
es0.3186582809224318
az0.3114406779661017
hr0.2995689655172413
cs0.24418604651162787
sk0.181360201511335
be0.17721518987341772
tr0.17721518987341772
hu0.1709183673469388
is0.16237113402061853
lt0.16237113402061853
lv0.16237113402061853
uk0.16237113402061853
vi0.16237113402061853
reliabletrue
passestrue
11
name"Selvi"
dialogue"Nahi! Woh aadmi... uska haath... aur phir woh gayab ho gaya... hawa mein..."
detectedLang"eu"
heavyLang"de"
scores
eu0.3055555555555556
de0.25
ca0.24242424242424243
tl0.2299794661190965
no0.22839506172839508
sk0.22839506172839508
cs0.2203742203742204
it0.2203742203742204
ro0.21875
ms0.21052631578947367
reliablefalse
passesfalse
12
name"Hunter"
dialogue"Disappeared? Miss, three people have vanished from this town in two weeks. If you witnessed something—"
detectedLang"en"
heavyLang"en"
scores
en0.8226051697921946
tl0.597932222860425
nl0.38971229293809934
da0.3223620522749274
it0.3076162215628091
ms0.2639327024185068
sv0.22822491730981256
pl0.18604651162790697
fr0.18032786885245902
sl0.18032786885245902
de0.17355371900826447
ro0.16864608076009502
eu0.16567342073897498
yo0.1116751269035533
fi0.11054637865311309
az0.10714285714285714
cs0.10025706940874037
es0.10025706940874037
pt0.10025706940874037
reliabletrue
passestrue
13
name"Selvi"
dialogue"Teen log? Toh agla number mera?"
detectedLang"en"
heavyLang"tlh"
scores
en0.5442708333333333
ms0.5283018867924528
sv0.5111731843575419
et0.4712990936555891
tl0.4664634146341463
fr0.375
yo0.35185185185185186
nl0.34944237918215615
hu0.3371212121212121
no0.3320610687022901
da0.31906614785992216
is0.31906614785992216
de0.308300395256917
sk0.2647058823529412
sl0.2647058823529412
reliablefalse
passesfalse
14
name"Hunter"
dialogue"Camera. May I?"
detectedLang"en"
heavyLang"ro"
scores
en0.7442455242966752
tl0.7429305912596401
it0.7319034852546917
ro0.7126436781609196
ca0.7058823529411765
sv0.6875
yo0.6491228070175439
vi0.609375
no0.5918367346938775
da0.5815899581589958
hr0.5815899581589958
sq0.5726495726495726
pl0.5575221238938053
cs0.5121951219512195
es0.49748743718592964
fr0.4318181818181818
nl0.4318181818181818
de0.4186046511627907
ms0.4186046511627907
sk0.4186046511627907
be0.4117647058823529
tr0.4117647058823529
hu0.40119760479041916
az0.38650306748466257
et0.38650306748466257
is0.38650306748466257
lt0.38650306748466257
lv0.38650306748466257
uk0.38650306748466257
reliabletrue
passestrue
15
name"Selvi"
dialogue"Haan, lekin savdhaan... usmein woh..."
detectedLang"fi"
heavyLang"de"
scores
fi0.4693396226415094
eu0.3951612903225807
sl0.3181818181818182
is0.281150159744409
reliablefalse
passesfalse
16
name"Hunter"
dialogue"The lens is cracked. I can't make out anything except branches."
detectedLang"en"
heavyLang"en"
scores
en0.804602692140686
tl0.5302713987473904
fr0.46808510638297873
da0.4437577255871446
nl0.4117647058823529
yo0.4109947643979057
sq0.4094488188976378
no0.40789473684210525
ca0.39759036144578314
sv0.3885869565217392
cs0.3697478991596639
de0.3679775280898876
es0.3430656934306569
ro0.34114202049780384
hu0.3293591654247392
hr0.3171471927162367
pl0.2913385826771654
it0.27884615384615385
ms0.25742574257425743
is0.256198347107438
pt0.2549668874172185
sk0.225473321858864
lt0.22413793103448276
be0.1346153846153846
tr0.1346153846153846
az0.12280701754385964
et0.12280701754385964
lv0.12280701754385964
sl0.12280701754385964
uk0.12280701754385964
vi0.12280701754385964
reliabletrue
passestrue
17
name"Selvi"
dialogue"Wahan! Uss jagah par... flash ke roshni mein..."
detectedLang"ms"
heavyLang"en"
scores
ms0.5607613469985359
fr0.4485294117647059
de0.402390438247012
sl0.39879759519038077
sv0.39148073022312374
da0.3617021276595745
lv0.34065934065934067
tl0.30394431554524365
et0.27184466019417475
cs0.2555831265508685
fi0.22680412371134018
yo0.22680412371134018
sq0.20424403183023873
reliablefalse
passesfalse
18
name"Hunter"
dialogue"Hunter. ... You're kidding. When did they find it? ... I'm at the Bean Counter. ... Ten minutes."
detectedLang"en"
heavyLang"en"
scores
en0.81
tl0.5858761987794247
da0.5514636449480642
no0.5410628019323671
nl0.4656917885264342
fr0.40176322418136023
ms0.38551099611901685
sq0.37582128777923784
sk0.34119278779472956
de0.3271954674220963
sv0.31457431457431456
ca0.3035190615835778
yo0.27811550151975684
cs0.26810477657935283
pl0.2658423493044822
lt0.24960505529225907
et0.2313915857605178
sl0.21617161716171615
it0.21487603305785125
ro0.21357615894039736
pt0.2096505823627288
lv0.1532976827094474
fi0.1316270566727605
is0.1316270566727605
es0.12361623616236161
hr0.12361623616236161
tr0.11048689138576778
reliabletrue
passestrue
19
name"Hunter"
dialogue"I have to go. They found something down by the creek."
detectedLang"en"
heavyLang"en"
scores
en0.8413705583756345
tl0.638728323699422
da0.596340150699677
pl0.5649651972157773
yo0.5210727969348659
cs0.49460916442048514
no0.4820441988950276
sk0.4755244755244755
hr0.40476190476190477
sv0.3654822335025381
sq0.36115843270868825
es0.336283185840708
lv0.33035714285714285
nl0.32915921288014316
lt0.3081180811808118
ms0.2937853107344633
ca0.2690058479532163
sl0.24395161290322584
it0.23469387755102045
ro0.1935483870967742
fr0.17218543046357618
pt0.17218543046357618
be0.15730337078651685
tr0.15730337078651685
hu0.1515837104072398
az0.14383561643835618
et0.14383561643835618
is0.14383561643835618
uk0.14383561643835618
vi0.14383561643835618
reliabletrue
passestrue
20
name"Selvi"
dialogue"Please mat jao! Woh yahin hai... mujhe lag raha hai..."
detectedLang"en"
heavyLang"sr"
scores
en0.45750452079566006
tl0.4350282485875706
no0.36575052854122625
sv0.358974358974359
vi0.3435448577680525
et0.3197278911564626
it0.25
ro0.25
is0.21875
de0.20424403183023873
nl0.20424403183023873
lt0.2
ms0.2
sk0.2
reliablefalse
passesfalse
21
name"Hunter"
dialogue"Stay here. Don't leave. Don't delete those photos."
detectedLang"en"
heavyLang"en"
scores
en0.8098859315589354
fr0.494949494949495
sq0.4708994708994709
pt0.34782608695652173
no0.3464052287581699
es0.3288590604026846
ro0.3212669683257918
ca0.3150684931506849
nl0.3150684931506849
hu0.21875
lt0.21052631578947367
hi0.20424403183023873
tl0.20424403183023873
et0.2
sl0.2
tr0.2
de0.19137466307277629
lv0.19137466307277629
da0.18256130790190736
it0.18256130790190736
sv0.18256130790190736
reliabletrue
passestrue
22
name"Selvi"
dialogue"Aap wapas aaoge?"
detectedLang"es"
heavyLang"fi"
scores
es0.4444444444444444
pt0.4444444444444444
reliablefalse
passesfalse
23
name"Hunter"
dialogue"Just... stay put. I'll send Officer Chen. She speaks Punjabi—maybe close enough?"
detectedLang"en"
heavyLang"en"
scores
en0.7790697674418605
tl0.4716351501668521
et0.3365921787709497
sv0.26810477657935283
de0.2566510172143975
zh0.2448330683624801
hr0.20833333333333331
sl0.19491525423728812
hi0.1810344827586207
fr0.14721723518850988
nl0.14721723518850988
no0.14721723518850988
ca0.1394927536231884
lt0.13793103448275862
da0.1316270566727605
reliabletrue
passestrue
24
name"Selvi"
dialogue"Jaldi kijiye... dar lag raha hai..."
detectedLang"et"
heavyLang"is"
scores
et0.522673031026253
vi0.43977591036414565
ro0.3902439024390244
lt0.38650306748466257
eu0.3464052287581699
it0.3333333333333333
no0.3288590604026846
is0.3197278911564626
es0.3174061433447099
pt0.3174061433447099
sv0.3174061433447099
de0.2779783393501805
nl0.2779783393501805
reliablefalse
passesfalse
25
name"Hunter"
dialogue"Yeah. Me too."
detectedLang"en"
heavyLang"en"
scores
en0.7448979591836735
sq0.663677130044843
es0.576271186440678
et0.5588235294117647
fr0.5481927710843374
no0.5481927710843374
pt0.5398773006134969
sl0.5398773006134969
ms0.5283018867924528
nl0.5283018867924528
it0.5161290322580645
fi0.5
reliabletrue
passestrue