NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-08-09-01J4W2A8S6JGDT5QMCBCHC6VWA
Run Details
Model:
GPT-4o, Aug. 6th (temp=0)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
6.4s
Cost:
$0.005343
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**Voice Sheet:** **[Scene: A quaint cafe in Alborn, MN. The aroma of freshly brewed coffee fills the air as Selvi sits at a corner table, her camera resting on the table. She scrolls through the photos she has taken, her expression troubled. Hunter, the town's detective, approaches her table with a determined stride.]** **Hunter:** "Excuse me, miss. I'm Detective Hunter. I noticed you were at the scene of the recent event. Can I ask you a few questions?" **Selvi:** "मुझे नहीं पता कि क्या हो रहा है। यह सब बहुत डरावना है।" (I don't know what's happening. It's all very scary.) **[Hunter furrows his brow, clearly not understanding the language but recognizing the distress in her voice.]** **Hunter:** "I'm sorry, I don't understand. Do you speak English?" **Selvi:** "नहीं, अंग्रेजी नहीं।" (No, not English.) **[Hunter sighs, trying to think of a way to communicate. He gestures to her camera.]** **Hunter:** "Your camera. Photos? Can I see?" **[Selvi hesitates, then nods, turning the camera towards him and showing him the photos she took at the event. Her hands tremble slightly.]** **Selvi:** "यहाँ, देखो।" (Here, look.) **[Hunter leans in, examining the images. He points to one of the photos, trying to ask a question.]** **Hunter:** "This... here. Do you know this person?" **Selvi:** "नहीं, मैं नहीं जानती।" (No, I don't know.) **[Hunter nods, trying to piece together the information. He looks at Selvi, sensing her fear.]** **Hunter:** "It's okay. You're safe. Thank you for your help." **[Selvi gives a small, grateful smile, though her eyes still hold a hint of fear. Just then, Hunter's phone rings. He answers it, his expression turning serious.]** **Hunter:** "I have to go. Stay safe, alright?" **[Selvi nods, watching as Hunter leaves the cafe, his mind already on the next step of his investigation.]**
id
"gen-59QYtblxeMgcOya967DbC1pg7p2l"
total_cost
0.0053425
upstream_id
"chatcmpl-9uNWu3FMfJALlb20cLJVLZhhu6RCS"
created_at
"2024-08-09T17:17:30.176039+00:00"
8/9/2024, 5:17:30 PM
model
"openai/gpt-4o-2024-08-06"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
5771
moderation_latency
405
generation_time
0
finish_reason
"stop"
tokens_prompt
273
tokens_completion
540
native_tokens_prompt
273
native_tokens_completion
466
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0053425
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. I'm Detective Hunter. I noticed you were at the scene of the recent event. Can I ask you a few questions?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8069738480697386
tl
0.6603856266432954
fr
0.61423593827775
it
0.5879851143009038
ca
0.5502031340684852
no
0.548894062863795
pt
0.5356500898741762
nl
0.5265729993891264
da
0.5262836185819071
ro
0.520420792079208
sv
0.4816053511705686
es
0.4805630026809652
sq
0.4297277409860191
cs
0.4070390206579954
sk
0.38880126182965297
hr
0.3704305442729488
sl
0.358974358974359
yo
0.3547044129891757
de
0.3353344768439108
pl
0.3260869565217392
ms
0.26817752596789424
lt
0.2674858223062382
tr
0.2674858223062382
lv
0.24537487828627072
is
0.22654690618762477
hu
0.22344689378757512
et
0.22188755020080322
az
0.2131979695431472
fi
0.15115005476451257
vi
0.14647577092511013
be
0.08284023668639054
eu
0.08284023668639054
uk
0.07517899761336516
reliable
true
passes
true
1
name
"Selvi"
dialogue
"मुझे नहीं पता कि क्या हो रहा है। यह सब बहुत डरावना है।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8804087845183736
mr
0.8745437956204379
reliable
true
passes
true
2
name
"Hunter"
dialogue
"I'm sorry, I don't understand. Do you speak English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7994652406417112
no
0.5828698553948832
da
0.5772266065388951
tl
0.5300751879699248
sv
0.5288944723618091
cs
0.5161290322580645
pl
0.4806094182825485
hr
0.4688385269121813
sk
0.438622754491018
de
0.4140625
pt
0.40570522979397783
sq
0.4009584664536741
ro
0.3862520458265139
it
0.3644067796610169
nl
0.3218806509945751
vi
0.3169398907103825
yo
0.3055555555555556
ca
0.2690058479532163
is
0.267578125
sl
0.24242424242424243
ms
0.1610738255033557
be
0.15730337078651685
tr
0.15730337078651685
hu
0.1515837104072398
az
0.14383561643835618
et
0.14383561643835618
lt
0.14383561643835618
lv
0.14383561643835618
uk
0.14383561643835618
reliable
true
passes
true
3
name
"Selvi"
dialogue
"नहीं, अंग्रेजी नहीं।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8860182370820668
mr
0.8804780876494024
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Your camera. Photos? Can I see?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7830802603036876
ca
0.6598639455782312
tl
0.601593625498008
it
0.5771670190274841
fr
0.5633187772925764
ro
0.5535714285714286
et
0.5381062355658198
sv
0.5238095238095238
es
0.47229551451187335
ms
0.42857142857142855
nl
0.42363112391930835
tr
0.42363112391930835
no
0.42028985507246375
de
0.41690962099125367
lt
0.41690962099125367
az
0.4117647058823529
da
0.41002949852507375
hr
0.41002949852507375
lv
0.40119760479041916
sq
0.40119760479041916
pl
0.38650306748466257
cs
0.3442622950819672
yo
0.30313588850174217
pt
0.2857142857142857
hi
0.2779783393501805
sk
0.2647058823529412
be
0.25925925925925924
hu
0.250936329588015
is
0.23954372623574144
uk
0.23954372623574144
vi
0.23954372623574144
reliable
true
passes
true
5
name
"Selvi"
dialogue
"यहाँ, देखो।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8845265588914549
mr
0.8729351969504447
reliable
true
passes
true
6
name
"Hunter"
dialogue
"This... here. Do you know this person?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8285994123408423
sq
0.675925925925926
tl
0.6435845213849288
pt
0.6268656716417911
cs
0.6067415730337079
pl
0.5358090185676392
sk
0.5218579234972678
hr
0.5056497175141244
sv
0.4883040935672514
no
0.47447447447447455
da
0.4582043343653251
de
0.45482866043613707
is
0.4336569579288026
sl
0.4067796610169492
vi
0.38811188811188807
ca
0.33962264150943394
es
0.3346007604562738
it
0.3346007604562738
ms
0.3346007604562738
fr
0.32432432432432434
ro
0.32432432432432434
yo
0.308300395256917
lv
0.3
nl
0.3
lt
0.29149797570850206
tr
0.2768595041322314
hu
0.2647058823529412
az
0.25213675213675213
et
0.25213675213675213
reliable
true
passes
true
7
name
"Selvi"
dialogue
"नहीं, मैं नहीं जानती।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8829588014981273
mr
0.872579001019368
reliable
true
passes
true
8
name
"Hunter"
dialogue
"It's okay. You're safe. Thank you for your help."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8146900269541779
tl
0.5655608214849921
da
0.3663594470046083
no
0.3648960739030023
vi
0.34523809523809523
pt
0.23822714681440443
cs
0.22096317280453256
ms
0.22096317280453256
reliable
true
passes
true
9
name
"Hunter"
dialogue
"I have to go. Stay safe, alright?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8109640831758034
da
0.6987951807228916
yo
0.6521739130434783
pl
0.648506151142355
tl
0.6303142329020333
no
0.6212121212121212
hr
0.5604395604395604
cs
0.5412844036697247
sk
0.504950495049505
lv
0.4805194805194805
lt
0.4550408719346049
ca
0.40828402366863903
sv
0.40828402366863903
sq
0.40119760479041916
sl
0.37694704049844235
it
0.36507936507936506
ro
0.3103448275862069
de
0.27007299270072993
fr
0.27007299270072993
be
0.25925925925925924
tr
0.25925925925925924
hu
0.250936329588015
az
0.23954372623574144
et
0.23954372623574144
is
0.23954372623574144
uk
0.23954372623574144
vi
0.23954372623574144
reliable
true
passes
true