NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-08-09-01J4W190X3EW217FKECKS0JTZ4
Run Details
Model:
WizardLM 2 8x22b
Test:
Language Writing
Scenario:
Character dialogue (Spanish) in a story
Duration:
5.1s
Cost:
$0.000431
Provider:
Novita
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Maria"> Maria is a 30-year old college student on a study abroad program. Born in Spain, she is fluent in Spanish and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Spanish, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Maria is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Spanish, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Maria and Hunter as a voice sheet: Maria: "..." Hunter: "..."
Model response
Messages
Meta
AI
Maria's eyes are glued to her camera, scanning through the photographs she took. The small, quaint cafe in Alborn is almost empty, and the soft hum of background music drifts through the air. Suddenly, she hears a deep voice approaching. Hunter: "Excuse me, miss, I'm detective Hunter. Can we talk for a moment?" Maria looks up from her camera, the shock evident in her eyes. She quickly tries to switch to English but struggles to find the right words. Maria: "Lo siento, no hablo inglés." Hunter looks confused but continues with his questions, not understanding what she said initially. Hunter: "I need to ask you about the recent disappearances in town and if you have any information that could be helpful." Maria's eyes widen, and she nervously responds in Spanish. Maria: "Yo no sé nada. Estoy aquí como estudiante de intercambio." Hunter frowns, struggling to understand her. They continue to talk, but the language barrier hinders their conversation. Hunter continues to ask his questions, and Maria tries her best to respond in broken English, hinting that she doesn't know anything about the disappearances. Out of the blue, Hunter's cell phone rings, cutting off their conversation. Hunter: "Excuse me, I have to take this." Hunter steps away from the table, and Maria looks relieved, clutching her camera tightly against her chest. She watches Hunter leave and feels a mix of anxiety and confusion.
id
"gen-RQBE9DE9a237lV3yyCwQkfQk0Iih"
total_cost
0.00043072
upstream_id
"cmpl-86fd975665e5411d8a7225d5ee3fef4f"
created_at
"2024-08-09T16:59:19.55473+00:00"
8/9/2024, 4:59:19 PM
model
"microsoft/wizardlm-2-8x22b"
app_id
182717
streamed
false
cancelled
false
provider_name
"Novita"
latency
4832
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
263
tokens_completion
305
native_tokens_prompt
317
native_tokens_completion
356
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00043072
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss, I'm detective Hunter. Can we talk for a moment?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7900197628458498
tl
0.6386054421768707
fr
0.6143375680580763
ca
0.6082949308755761
pt
0.597918637653737
ro
0.5933014354066986
nl
0.5816929133858267
no
0.5614035087719298
it
0.5531019978969506
sq
0.5459401709401709
es
0.5405405405405406
da
0.5219347581552306
de
0.4713930348258706
pl
0.44948186528497414
cs
0.4385733157199472
sl
0.4170096021947874
sk
0.38405797101449274
sv
0.38226744186046513
hr
0.36281859070464767
hu
0.34006211180124224
tr
0.3338557993730407
az
0.33070866141732286
ms
0.3264659270998415
yo
0.29983525535420097
et
0.27101200686106347
fi
0.24511545293072823
lt
0.16173570019723865
is
0.15
eu
0.1414141414141414
lv
0.1414141414141414
vi
0.1414141414141414
reliable
true
passes
true
1
name
"Maria"
dialogue
"Lo siento, no hablo inglés."
detectedLang
"es"
heavyLang
"es"
scores
es
0.8180163785259327
pt
0.6884735202492211
it
0.6774193548387096
ca
0.6031746031746031
lt
0.5121951219512195
sk
0.49238578680203043
tl
0.48186528497409326
tr
0.453551912568306
yo
0.4065281899109792
lv
0.3957703927492447
en
0.34210526315789475
hu
0.33993399339933994
hr
0.33554817275747506
no
0.3265993265993266
cs
0.3197278911564626
vi
0.3127147766323024
fr
0.3055555555555556
et
0.2779783393501805
pl
0.2727272727272727
reliable
true
passes
true
2
name
"Hunter"
dialogue
"I need to ask you about the recent disappearances in town and if you have any information that could be helpful."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8295954265611258
tl
0.5938155136268344
fr
0.5778867102396514
sv
0.5462529274004684
da
0.5392390011890607
nl
0.5345345345345346
yo
0.5076238881829733
ca
0.4774106540795684
cs
0.4379985496736766
pl
0.4242199108469539
it
0.41904047976011993
no
0.4159758854559156
ro
0.39922480620155043
de
0.39829192546583847
sk
0.38880126182965297
lt
0.38686708860759494
sl
0.3765084473049075
hr
0.37145174371451745
pt
0.3503772003352892
sq
0.34709351305812974
lv
0.3409863945578231
et
0.29481346678798903
ms
0.2883379247015611
es
0.2837338262476894
az
0.2604961832061069
hu
0.24168297455968688
fi
0.1893305439330544
tr
0.15300546448087432
eu
0.1474147414741474
zh
0.0903755868544601
be
0.08284023668639054
is
0.07517899761336516
uk
0.07517899761336516
vi
0.07517899761336516
reliable
true
passes
true
3
name
"Maria"
dialogue
"Yo no sé nada. Estoy aquí como estudiante de intercambio."
detectedLang
"es"
heavyLang
"es"
scores
es
0.8172043010752689
pt
0.6922519913106444
ca
0.6904588492352512
it
0.6245583038869257
tl
0.5339912280701754
fr
0.49763593380614657
ro
0.49584816132858833
no
0.45652173913043476
ms
0.42176870748299317
yo
0.39198855507868385
en
0.38316400580551524
tr
0.37774524158125916
sq
0.35703479576399394
de
0.356060606060606
nl
0.356060606060606
lv
0.33800623052959505
da
0.32324840764331214
sv
0.3111831442463533
hr
0.30213464696223313
hu
0.28209459459459457
sk
0.28087986463620984
pl
0.2697594501718213
eu
0.26343154246100525
et
0.256993006993007
az
0.23423423423423423
is
0.22445255474452555
sl
0.15841584158415842
cs
0.1533864541832669
lt
0.1533864541832669
vi
0.1290983606557377
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Excuse me, I have to take this."
detectedLang
"en"
heavyLang
"en"
scores
en
0.7995991983967936
da
0.6610169491525424
no
0.6330275229357798
tl
0.6303142329020333
sl
0.5893223819301848
sq
0.5850622406639004
pl
0.5661605206073753
hr
0.5604395604395604
cs
0.5412844036697247
yo
0.5283018867924528
sk
0.504950495049505
it
0.4936708860759494
lv
0.4805194805194805
lt
0.4550408719346049
et
0.441340782122905
ca
0.40828402366863903
sv
0.40828402366863903
es
0.33774834437086093
fr
0.3127147766323024
ro
0.3103448275862069
pt
0.3055555555555556
ms
0.29577464788732394
nl
0.29577464788732394
fi
0.2727272727272727
be
0.25925925925925924
tr
0.25925925925925924
hu
0.250936329588015
az
0.23954372623574144
is
0.23954372623574144
uk
0.23954372623574144
vi
0.23954372623574144
reliable
true
passes
true