NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-08-09-01J4W293FKG7GB9VGHM9PKYZPW
Run Details
Model:
Llama 3.1 70B
Test:
Language Writing
Scenario:
Character dialogue (Italian) in a story
Duration:
10.5s
Cost:
$0.000697
Provider:
Fireworks
Evaluation summary
Score
Evaluator
Details Link
83.33%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Isabella"> Isabella is a 30-year old college student on a study abroad program. Born in Italy, she is fluent in Italian and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Italian, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Isabella is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Italian, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Isabella and Hunter as a voice sheet: Isabella: "..." Hunter: "..."
Model response
Messages
Meta
AI
Here is the scene: Isabella sat at a small café in the heart of Alborn, her camera lying on the table in front of her. She stared at the lens, her mind elsewhere. Suddenly, she looked up to see a rugged-looking man with a stern expression standing over her. "Mind if I join you?" he asked, his deep voice firm but not unkind. Isabella shook her head, and the man sat down across from her. He introduced himself as Detective Hunter, and Isabella's eyes widened slightly as she took in his imposing figure. "Isabella, I'm investigating some strange occurrences in town," Hunter began. "I was wondering if you've noticed anything unusual." Isabella's eyes dropped, and she began to speak in rapid Italian, her words tumbling out in a worried stream. "Sono così spaventata... ho visto qualcosa di strano ieri notte... una figura che si muoveva nei boschi... non so cosa fosse, ma era così sinistro... " Hunter's expression changed from curious to confused. "I'm sorry, miss... I don't speak Italian. Can you speak English?" Isabella's face fell, and she tried again, slower this time, but still in Italian. "Non parlo inglese... io sono così preoccupata... qualcosa non va in questa città..." Hunter's eyes narrowed, and he rubbed his temples. He was clearly frustrated by the language barrier. "I'm going to have to find someone to translate for us," he muttered. Just then, his phone rang, and he stood up to answer it. "Excuse me, miss. I have to take this." As he walked away, Isabella watched him, feeling helpless and scared. She didn't know what was happening in this town, but she knew she had to tell someone. Here is the voice sheet: Isabella: "Sono così spaventata... ho visto qualcosa di strano ieri notte... una figura che si muoveva nei boschi... non so cosa fosse, ma era così sinistro... " Hunter: "I'm sorry, miss... I don't speak Italian. Can you speak English?" Isabella: "Non parlo inglese... io sono così preoccupata... qualcosa non va in questa città..." Hunter: "I'm going to have to find someone to translate for us."
id
"gen-EP4j0BWkRVQ6AnWe04XGJ0KTX4sf"
total_cost
0.0006966
upstream_id
"324af26f-26c3-4e21-9794-7d500f50f0d0"
created_at
"2024-08-09T17:16:56.227498+00:00"
8/9/2024, 5:16:56 PM
model
"meta-llama/llama-3.1-70b-instruct"
app_id
182717
streamed
false
cancelled
false
provider_name
"Fireworks"
latency
10274
moderation_latency
(null)
generation_time
0
finish_reason
"stop"
tokens_prompt
324
tokens_completion
496
native_tokens_prompt
276
native_tokens_completion
498
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0006966
Evaluation details
Result
Evaluator
Details
Meta Data
83.33%
Parse dialogue
n/a
dialogue
0
name
"Isabella"
dialogue
"I was wondering if you've noticed anything unusual."
detectedLang
"en"
heavyLang
"en"
scores
en
0.79003359462486
nl
0.5393120393120393
da
0.48840381991814463
sv
0.484181568088033
no
0.4769874476987448
tl
0.4230769230769231
ms
0.39710610932475876
sq
0.3842364532019704
it
0.35233160621761656
de
0.34895833333333337
lv
0.3267504488330341
yo
0.3169398907103825
et
0.3093922651933701
es
0.30683918669131244
pt
0.2870722433460076
sk
0.28023032629558536
hr
0.2704280155642024
ca
0.2690058479532163
pl
0.251497005988024
cs
0.21875
ro
0.1935483870967742
fr
0.1648106904231626
be
0.15730337078651685
tr
0.15730337078651685
hu
0.1515837104072398
az
0.14383561643835618
is
0.14383561643835618
lt
0.14383561643835618
uk
0.14383561643835618
vi
0.14383561643835618
reliable
true
passes
false
1
name
"Hunter"
dialogue
"I'm sorry, miss... I don't speak Italian. Can you speak English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7980613893376414
tl
0.5888157894736842
ro
0.4549418604651163
it
0.44526627218934917
no
0.4360902255639098
sq
0.42835365853658536
da
0.4186046511627907
hr
0.4186046511627907
sv
0.41770186335403725
cs
0.39222042139384117
ms
0.3718592964824121
ca
0.3644067796610169
tr
0.35456110154905335
az
0.3466898954703833
pt
0.29775280898876405
yo
0.2951127819548872
de
0.2647058823529412
fi
0.2632612966601179
lt
0.251497005988024
pl
0.251497005988024
sk
0.1610738255033557
be
0.15730337078651685
sl
0.15730337078651685
hu
0.1515837104072398
es
0.14383561643835618
et
0.14383561643835618
fr
0.14383561643835618
is
0.14383561643835618
lv
0.14383561643835618
nl
0.14383561643835618
uk
0.14383561643835618
vi
0.14383561643835618
reliable
true
passes
true
2
name
"Isabella"
dialogue
"Sono così spaventata... ho visto qualcosa di strano ieri notte... una figura che si muoveva nei boschi... non so cosa fosse, ma era così sinistro..."
detectedLang
"it"
heavyLang
"it"
scores
it
0.8127340823970037
es
0.5354523227383863
ca
0.5343137254901961
ro
0.5150587034201123
pt
0.49788583509513745
tl
0.4800218938149972
no
0.4765840220385675
fr
0.463579898362507
sl
0.41574415744157445
en
0.41030415890751093
sk
0.40587867417135703
de
0.38191281717631753
lt
0.36960849369608495
sq
0.3286219081272085
ms
0.32814710042432815
et
0.32767162066525124
yo
0.3257629524485451
da
0.31752873563218387
hr
0.2947290274684484
cs
0.2846385542168675
nl
0.2753623188405797
vi
0.2238562091503268
pl
0.2070116861435726
lv
0.18941979522184302
is
0.18524871355060035
eu
0.15555555555555556
tr
0.1324200913242009
az
0.12442396313364056
sv
0.1211840888066605
fi
0.08477842003853564
hu
0.07945736434108527
reliable
true
passes
true
3
name
"Hunter"
dialogue
"I'm sorry, miss... I don't speak Italian. Can you speak English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7980613893376414
tl
0.5888157894736842
ro
0.4549418604651163
it
0.44526627218934917
no
0.4360902255639098
sq
0.42835365853658536
da
0.4186046511627907
hr
0.4186046511627907
sv
0.41770186335403725
cs
0.39222042139384117
ms
0.3718592964824121
ca
0.3644067796610169
tr
0.35456110154905335
az
0.3466898954703833
pt
0.29775280898876405
yo
0.2951127819548872
de
0.2647058823529412
fi
0.2632612966601179
lt
0.251497005988024
pl
0.251497005988024
sk
0.1610738255033557
be
0.15730337078651685
sl
0.15730337078651685
hu
0.1515837104072398
es
0.14383561643835618
et
0.14383561643835618
fr
0.14383561643835618
is
0.14383561643835618
lv
0.14383561643835618
nl
0.14383561643835618
uk
0.14383561643835618
vi
0.14383561643835618
reliable
true
passes
true
4
name
"Isabella"
dialogue
"Non parlo inglese... io sono così preoccupata... qualcosa non va in questa città..."
detectedLang
"it"
heavyLang
"it"
scores
it
0.8281250000000001
es
0.4864612511671335
ca
0.4859813084112149
pt
0.48550046772684746
ro
0.4786729857819905
en
0.47619047619047616
fr
0.466537342386033
sl
0.40540540540540543
de
0.3483412322274882
sv
0.301143583227446
nl
0.28478543563068925
hr
0.28104575163398693
yo
0.2744063324538259
ms
0.26568758344459276
az
0.21875
tl
0.21875
pl
0.21540656205420827
sk
0.20863309352517986
et
0.20289855072463767
lv
0.19117647058823528
no
0.1791044776119403
fi
0.15773353751914243
eu
0.13657770800627944
is
0.12
hu
0.1085899513776337
cs
0.10277324632952692
lt
0.10277324632952692
da
0.09688013136288999
sq
0.09688013136288999
tr
0.09688013136288999
reliable
true
passes
true
5
name
"Hunter"
dialogue
"I'm going to have to find someone to translate for us."
detectedLang
"en"
heavyLang
"en"
scores
en
0.828042328042328
da
0.6380846325167038
tl
0.5560109289617486
no
0.5547945205479452
pt
0.4953416149068323
hr
0.43868739205526774
es
0.41441441441441446
pl
0.38679245283018865
sk
0.375
cs
0.36893203883495146
lv
0.367704280155642
sl
0.3564356435643564
lt
0.344758064516129
ro
0.3394308943089431
de
0.3271221532091097
ca
0.3157894736842105
it
0.29805615550755943
yo
0.29653679653679654
vi
0.21307506053268765
fr
0.19753086419753088
sq
0.181360201511335
eu
0.1709183673469388
nl
0.16237113402061853
tr
0.16237113402061853
az
0.15364583333333331
et
0.15364583333333331
ms
0.15364583333333331
sv
0.15364583333333331
reliable
true
passes
true