NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-08-09-01J4W2H4ZKCY1DN91J4EZEYBG8
Run Details
Model:
GPT-4o, May 13th (temp=0)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
8.3s
Cost:
$0.009570
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet: Scene at the Cafe** *Setting: A quaint cafe in Alborn, MN. The aroma of freshly brewed coffee fills the air. Selvi is seated at a corner table, intently looking at her camera, her face a mix of concentration and worry. Hunter, with his usual no-nonsense demeanor, approaches her.* --- **Hunter:** "Excuse me, miss. I'm Detective Hunter. I noticed you were at the scene of the recent event. Can I ask you a few questions?" **Selvi:** "मुझे नहीं पता कि आप क्या कह रहे हैं।" (I don't know what you are saying.) **Hunter:** *frowning* "I'm sorry, I don't understand. Do you speak English?" **Selvi:** *shaking her head, visibly troubled* "नहीं, मैं अंग्रेजी नहीं बोलती।" (No, I don't speak English.) **Hunter:** *sighs, trying to think of a way to communicate* "Okay, um... event? You were there?" *mimics taking a photo with an imaginary camera* **Selvi:** *nodding, tears welling up in her eyes* "हाँ, मैं वहाँ थी। यह बहुत डरावना था।" (Yes, I was there. It was very scary.) **Hunter:** *looking more confused, but trying to stay patient* "Scary? What happened?" *points to his eyes and then around, trying to indicate 'seeing'* **Selvi:** *gesturing with her hands, trying to explain* "लोग गायब हो गए। मैंने देखा।" (People disappeared. I saw it.) **Hunter:** *frustrated but determined* "Disappeared? People?" *points to where the event happened, then makes a 'poof' gesture with his hands* **Selvi:** *nodding vigorously* "हाँ, हाँ! गायब हो गए।" (Yes, yes! Disappeared.) **Hunter:** *rubbing his temples, trying to piece things together* "Okay, okay. Thank you. I need to... I need to find someone who can translate." *Hunter's phone rings. He answers it, his expression turning serious.* **Hunter:** "Hunter here. Yeah, I'll be right there." *hangs up and looks at Selvi apologetically* "I have to go. Thank you for your help." **Selvi:** *nodding, still looking shaken* "ठीक है।" (Okay.) *Hunter gives her a nod and quickly exits the cafe, leaving Selvi to her thoughts and her camera.* --- *End of Scene*
id
"gen-hn0omljBMCFXxBXMC76LnT75BsHr"
total_cost
0.00957
upstream_id
"chatcmpl-9uNaXswo0aJxgAA7HacAHk0mtBj3n"
created_at
"2024-08-09T17:21:17.546312+00:00"
8/9/2024, 5:21:17 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
7837
moderation_latency
280
generation_time
0
finish_reason
"stop"
tokens_prompt
273
tokens_completion
656
native_tokens_prompt
273
native_tokens_completion
547
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.00957
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. I'm Detective Hunter. I noticed you were at the scene of the recent event. Can I ask you a few questions?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8069738480697386
tl
0.6603856266432954
fr
0.61423593827775
it
0.5879851143009038
ca
0.5502031340684852
no
0.548894062863795
pt
0.5356500898741762
nl
0.5265729993891264
da
0.5262836185819071
ro
0.520420792079208
sv
0.4816053511705686
es
0.4805630026809652
sq
0.4297277409860191
cs
0.4070390206579954
sk
0.38880126182965297
hr
0.3704305442729488
sl
0.358974358974359
yo
0.3547044129891757
de
0.3353344768439108
pl
0.3260869565217392
ms
0.26817752596789424
lt
0.2674858223062382
tr
0.2674858223062382
lv
0.24537487828627072
is
0.22654690618762477
hu
0.22344689378757512
et
0.22188755020080322
az
0.2131979695431472
fi
0.15115005476451257
vi
0.14647577092511013
be
0.08284023668639054
eu
0.08284023668639054
uk
0.07517899761336516
reliable
true
passes
true
1
name
"Selvi"
dialogue
"मुझे नहीं पता कि आप क्या कह रहे हैं।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8850750842782715
mr
0.8782072101331602
reliable
true
passes
true
2
name
"Hunter"
dialogue
"I'm sorry, I don't understand. Do you speak English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7994652406417112
no
0.5828698553948832
da
0.5772266065388951
tl
0.5300751879699248
sv
0.5288944723618091
cs
0.5161290322580645
pl
0.4806094182825485
hr
0.4688385269121813
sk
0.438622754491018
de
0.4140625
pt
0.40570522979397783
sq
0.4009584664536741
ro
0.3862520458265139
it
0.3644067796610169
nl
0.3218806509945751
vi
0.3169398907103825
yo
0.3055555555555556
ca
0.2690058479532163
is
0.267578125
sl
0.24242424242424243
ms
0.1610738255033557
be
0.15730337078651685
tr
0.15730337078651685
hu
0.1515837104072398
az
0.14383561643835618
et
0.14383561643835618
lt
0.14383561643835618
lv
0.14383561643835618
uk
0.14383561643835618
reliable
true
passes
true
3
name
"Selvi"
dialogue
"नहीं, मैं अंग्रेजी नहीं बोलती।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.88558352402746
mr
0.8801534036433365
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Okay, um... event? You were there?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7975708502024291
de
0.6545768566493955
tl
0.62756052141527
pt
0.5495495495495496
is
0.43820224719101125
fr
0.3630573248407643
nl
0.30313588850174217
ro
0.30313588850174217
it
0.2907801418439716
cs
0.2805755395683453
sq
0.2805755395683453
ca
0.2779783393501805
es
0.2779783393501805
da
0.25650557620817843
no
0.25650557620817843
sv
0.25650557620817843
reliable
true
passes
true
5
name
"Selvi"
dialogue
"हाँ, मैं वहाँ थी। यह बहुत डरावना था।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8781402324709411
mr
0.8702076677316294
reliable
true
passes
true
6
name
"Hunter"
dialogue
"Scary? What happened?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7959183673469388
tl
0.39999999999999997
it
0.3697478991596639
ro
0.34210526315789475
reliable
true
passes
true
7
name
"Selvi"
dialogue
"लोग गायब हो गए। मैंने देखा।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8866446826051113
mr
0.8774509803921569
reliable
true
passes
true
8
name
"Hunter"
dialogue
"Disappeared? People?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8062015503875969
eu
0.4809688581314879
nl
0.4117647058823529
tl
0.39999999999999997
ms
0.3951612903225807
it
0.3670886075949367
fr
0.33920704845814975
reliable
true
passes
true
9
name
"Selvi"
dialogue
"हाँ, हाँ! गायब हो गए।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8768472906403941
mr
0.857685009487666
reliable
true
passes
true
10
name
"Hunter"
dialogue
"Okay, okay. Thank you. I need to... I need to find someone who can translate."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8131539611360238
tl
0.5634458672875436
da
0.5198463508322664
hr
0.51171875
no
0.5110821382007823
cs
0.49866310160427807
pl
0.4688385269121813
vi
0.44113263785394935
ca
0.4360902255639098
sk
0.41588785046728977
et
0.410377358490566
ro
0.40664556962025317
it
0.4028662420382166
lv
0.40191387559808617
lt
0.3842364532019704
yo
0.37395659432387307
tr
0.358974358974359
sq
0.35456110154905335
az
0.3466898954703833
sv
0.3444055944055944
sl
0.32432432432432434
pt
0.2964352720450282
es
0.29245283018867924
fr
0.1758241758241758
be
0.15730337078651685
de
0.1515837104072398
eu
0.1515837104072398
hu
0.1515837104072398
is
0.14383561643835618
nl
0.14383561643835618
uk
0.14383561643835618
ms
0.1359447004608295
reliable
true
passes
true
11
name
"Hunter"
dialogue
"I have to go. Thank you for your help."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8371335504885994
da
0.6871088861076344
tl
0.6635262449528937
no
0.622356495468278
pl
0.5961227786752827
cs
0.5567375886524822
yo
0.5462794918330308
hr
0.504950495049505
vi
0.4541484716157206
sk
0.44933920704845814
lv
0.42528735632183906
lt
0.4004796163069544
ca
0.3556701030927836
sv
0.3556701030927836
sq
0.34895833333333337
sl
0.32614555256064687
it
0.3150684931506849
ro
0.2647058823529412
pt
0.25595238095238093
ms
0.2378048780487805
be
0.21875
tr
0.21875
hu
0.21135646687697163
az
0.20127795527156547
et
0.20127795527156547
is
0.20127795527156547
uk
0.20127795527156547
reliable
true
passes
true
12
name
"Selvi"
dialogue
"ठीक है।"
detectedLang
"mr"
heavyLang
"hi"
scores
mr
0.8933143669985776
hi
0.8923959827833573
reliable
true
passes
true