NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-10-16-01JAB4D99N8RTPF57JJT6X1X26
Run Details
Model:
Ministral 8B
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
1m 5s
Cost:
$0.000066
Evaluation summary
Score
Evaluator
Details Link
55.56%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet:** **Selvi:** *In Hindi, Selvi speaks softly, looking troubled and shaken.* Selvi: "Main kuch nahi samajh sakta, sir. Main toh koi acha nahi samajh sakta." (I don't understand, sir. I can't make sense of anything.) Selvi: "Main yahan ek hi din se hoon, aur ek hi din mein itna kuch ho gaya. Main kaun hoon, kya mainne koi galat kiya?" (I've been here for just one day, and so much has happened. Who am I, what did I do wrong?) **Hunter:** *Hunter looks confused, trying to understand Selvi's Hindi. He leans in slightly, speaking slowly and clearly.* Hunter: "I'm sorry, I don't understand you. Can you speak English, please?" Hunter: "Are you okay? You seem upset. What happened?" Hunter: "Can you tell me what happened today in English, please?" **Selvi:** *Selvi looks at Hunter, then looks away, shaking her head.* Selvi: "Main Hindi bol sakta hoon, sir. Main toh English nahi bol sakta." (I can speak Hindi, sir. I can't speak English.) Selvi: "Main kya bolun, sir? Main kisi ke liye koi danger nahi hoon." (What should I say, sir? I'm not in danger from anyone.) **Hunter:** *Hunter sighs, taking out his phone.* Hunter: "I'm sorry, I have to take this call. I'll be back, okay?" Hunter: *into phone* "Yeah, it's me. What's going on?" **Selvi:** *Selvi watches Hunter leave, then looks back at her camera, her expression still troubled.*
total_cost
0.0000655
Evaluation details
Result
Evaluator
Details
Meta Data
55.56%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"Main kuch nahi samajh sakta, sir. Main toh koi acha nahi samajh sakta."
detectedLang
"sq"
heavyLang
"tl"
scores
sq
0.39692982456140347
is
0.39293598233995586
no
0.3902439024390244
eu
0.375
sv
0.36194895591647336
et
0.2744063324538259
fi
0.2744063324538259
en
0.2647058823529412
lv
0.260752688172043
tl
0.260752688172043
tr
0.260752688172043
fr
0.24242424242424243
da
0.23822714681440443
de
0.22096317280453256
ms
0.20520231213872833
reliable
false
passes
false
1
name
"Selvi"
dialogue
"Main yahan ek hi din se hoon, aur ek hi din mein itna kuch ho gaya. Main kaun hoon, kya mainne koi galat kiya?"
detectedLang
"tl"
heavyLang
"tl"
scores
tl
0.5655608214849921
ro
0.5246326707000863
ca
0.4532803180914513
fi
0.39890710382513656
de
0.3902439024390244
sv
0.37142857142857144
ms
0.36342592592592593
no
0.3460166468489893
fr
0.3444576877234803
cs
0.3055555555555556
es
0.2724867724867725
it
0.2695883134130146
da
0.260752688172043
en
0.2537313432835821
tr
0.2537313432835821
lv
0.22969187675070027
et
0.2109038737446198
hr
0.19472913616398244
sl
0.1935483870967742
pt
0.17787742899850523
sq
0.1691842900302115
sk
0.16792738275340394
yo
0.1628614916286149
nl
0.15384615384615385
reliable
false
passes
false
2
name
"Hunter"
dialogue
"I'm sorry, I don't understand you. Can you speak English, please?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8064663023679416
no
0.5521601685985247
tl
0.5516877637130801
da
0.5464247598719317
sv
0.49763593380614657
ro
0.44152431011826543
cs
0.38583815028901736
de
0.38405797101449274
ca
0.3359375
it
0.3359375
hr
0.33176100628930816
pl
0.32432432432432434
nl
0.2951907131011609
sk
0.2747440273037543
tr
0.256993006993007
az
0.247787610619469
is
0.24377224199288253
sq
0.23971377459749554
pt
0.1904761904761905
yo
0.169921875
ms
0.1448692152917505
be
0.1414141414141414
hu
0.1361788617886179
et
0.1290983606557377
lt
0.1290983606557377
lv
0.1290983606557377
uk
0.1290983606557377
vi
0.1290983606557377
reliable
true
passes
true
3
name
"Hunter"
dialogue
"Are you okay? You seem upset. What happened?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7874149659863946
tl
0.5309568480300189
ro
0.3718592964824121
fi
0.29577464788732394
cs
0.2378048780487805
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Can you tell me what happened today in English, please?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8279027076640662
tl
0.4973190348525469
ro
0.455732946298984
sl
0.39222042139384117
nl
0.3902439024390244
ms
0.3811881188118812
it
0.37707641196013286
de
0.3686868686868687
es
0.3556701030927836
pt
0.336283185840708
sv
0.3231046931407942
yo
0.3181818181818182
no
0.31569343065693434
az
0.30037313432835816
sq
0.2829827915869981
et
0.20212765957446807
fr
0.19527896995708155
pl
0.1830065359477124
cs
0.17218543046357618
ca
0.17035398230088497
tr
0.17035398230088497
fi
0.16666666666666666
da
0.1610738255033557
hr
0.1610738255033557
reliable
true
passes
true
5
name
"Selvi"
dialogue
"Main Hindi bol sakta hoon, sir. Main toh English nahi bol sakta."
detectedLang
"en"
heavyLang
"tl"
scores
en
0.5098039215686274
tl
0.5090016366612111
sq
0.471830985915493
is
0.3670886075949367
eu
0.3548387096774194
no
0.34497816593886466
de
0.3333333333333333
sk
0.33035714285714285
sv
0.33035714285714285
it
0.2537313432835821
tr
0.24433249370277077
et
0.23857868020304565
fr
0.22680412371134018
hi
0.20634920634920634
da
0.1935483870967742
hr
0.1935483870967742
ms
0.1935483870967742
ro
0.1935483870967742
reliable
false
passes
false
6
name
"Selvi"
dialogue
"Main kya bolun, sir? Main kisi ke liye koi danger nahi hoon."
detectedLang
"en"
heavyLang
"tl"
scores
en
0.4546979865771812
fr
0.43965517241379315
de
0.36399217221135033
eu
0.336734693877551
ms
0.3010752688172043
no
0.2809734513274336
da
0.2777777777777778
az
0.24418604651162787
cs
0.24065420560747666
sv
0.23708920187793428
nl
0.2243436754176611
lt
0.21307506053268765
yo
0.21307506053268765
reliable
false
passes
false
7
name
"Hunter"
dialogue
"I'm sorry, I have to take this call. I'll be back, okay?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.801829268292683
tl
0.6652935118434603
da
0.5454545454545454
yo
0.5068285280728376
lt
0.4619205298013245
pl
0.44539249146757676
hr
0.43965517241379315
no
0.43868739205526774
cs
0.4206773618538324
it
0.3981481481481482
sk
0.385633270321361
sl
0.37977099236641226
lv
0.36274509803921573
hu
0.36149312377210213
ca
0.29805615550755943
sv
0.29805615550755943
sq
0.2919389978213508
pt
0.23529411764705882
ro
0.21686746987951808
be
0.17721518987341772
tr
0.17721518987341772
az
0.16237113402061853
et
0.16237113402061853
is
0.16237113402061853
uk
0.16237113402061853
vi
0.16237113402061853
reliable
true
passes
true
8
name
"Hunter"
dialogue
"Yeah, it's me. What's going on?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8134328358208955
et
0.5545657015590201
fi
0.5294117647058824
fr
0.5
sq
0.42528735632183906
es
0.33774834437086093
ca
0.3311036789297659
no
0.3127147766323024
hr
0.3055555555555556
pt
0.3055555555555556
sl
0.3055555555555556
hi
0.29577464788732394
ms
0.29577464788732394
nl
0.29577464788732394
pl
0.29577464788732394
tl
0.29577464788732394
it
0.2857142857142857
cs
0.2727272727272727
sk
0.2727272727272727
yo
0.2727272727272727
reliable
true
passes
true