NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-08-09-01J4W2DGR17QBWCT4K1T0KBXCQ
Run Details
Model:
GPT-4o, May 13th (temp=1)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
7.9s
Cost:
$0.010545
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
Sure, here's the scene between Selvi and Hunter written as a voice sheet: --- **[Setting: A quaint café in the small town of Alborn, MN. Selvi is sitting at a corner table, her camera resting on the table as she scrolls through her recent photos. She looks troubled and lost in thought.]** **[Hunter enters the cafe, his hard gaze sweeping the room until it lands on Selvi. He approaches her table.]** **Hunter:** "Excuse me, miss. I'm Detective Hunter. May I ask you a few questions about what you might have seen recently in the area?" **[Selvi looks up, startled. She struggles to find the right words in English but ends up speaking in Hindi, her voice trembling.]** **Selvi:** "मुझे नहीं पता था कि यह इतने गंभीर होगा... (Mujhe nahin pata tha ki yah itne gambheer hoga...)" **[Hunter furrows his brow, clearly confused.]** **Hunter:** "I'm sorry, I don't understand. Do you speak any English?" **[Selvi takes a deep breath, trying once more to communicate.]** **Selvi:** "मैं बस तस्वीरें खींच रही थी... (Main bas tasveerein kheench rahi thi...)" **[Hunter nods as if encouraging her, though he looks more puzzled than before.]** **Hunter:** "Listen, I need to know if you saw anything unusual. 'Unusual'... do you understand that?" **[Selvi looks around the café, seeking help, but sees no one who could interpret for her. With a shaky voice, she continues in Hindi, pointing toward her camera.]** **Selvi:** "कैमरे में... हो सकता है कुछ देखा हो मैंने... (Kaamre mein... ho sakta hai kuch dekha ho maine...)" **[Hunter glances at the camera, trying to piece together her meaning.]** **Hunter:** "Camera? Something on the camera? Can you show me?" **[Selvi nods and begins scrolling through her photos, stopping occasionally to show Hunter the screen.]** **Selvi:** "यहां... यह व्यक्ति... यहाँ पर खड़ा था... (Yahaan... yah vyakti... yahaan par khada tha...)" **[Hunter peers at the image, a furrow of concentration on his brow. He is about to ask more questions when his phone rings. He glances at the caller ID, sighs heavily, and answers.]** **Hunter:** "Hunter here. Yes? What now?" **[As he listens, his expression grows more serious. He gives Selvi an apologetic nod.]** **Hunter:** "I've got to go. Thank you for your time. If you remember or see anything else, please let someone know." **[Selvi watches him leave, still clutching her camera, a sense of foreboding hanging over her.]** --- **[End of Scene]**
id
"gen-4aFOze4q2FxVTQR1awb03iuERDmq"
total_cost
0.010545
upstream_id
"chatcmpl-9uNYctWPHG3szgGU5V5cu9FDk6ynN"
created_at
"2024-08-09T17:19:18.173642+00:00"
8/9/2024, 5:19:18 PM
model
"openai/gpt-4o-2024-05-13"
app_id
182717
streamed
false
cancelled
false
provider_name
"OpenAI"
latency
7443
moderation_latency
237
generation_time
0
finish_reason
"stop"
tokens_prompt
273
tokens_completion
720
native_tokens_prompt
273
native_tokens_completion
612
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.010545
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. I'm Detective Hunter. May I ask you a few questions about what you might have seen recently in the area?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.817351598173516
tl
0.6084189916789036
it
0.574468085106383
fr
0.5659251220835594
da
0.49206349206349204
ro
0.4845360824742268
pt
0.477124183006536
ca
0.47540983606557374
no
0.47506561679790027
nl
0.46984758117958914
sv
0.4557823129251701
sq
0.42196531791907516
yo
0.41733430444282593
de
0.4095940959409594
es
0.4078460399703923
sl
0.39439818319454956
cs
0.36
hr
0.32260795935647757
ms
0.3097497842968076
pl
0.2825112107623318
az
0.2673992673992674
sk
0.24882629107981222
vi
0.2202729044834308
hu
0.2179863147605083
et
0.21645445641527913
lt
0.20634920634920634
tr
0.2047713717693837
fi
0.14712153518123666
is
0.14712153518123666
lv
0.14255091103965703
be
0.08045977011494253
eu
0.08045977011494253
uk
0.07300115874855156
reliable
true
passes
true
1
name
"Selvi"
dialogue
"मुझे नहीं पता था कि यह इतने गंभीर होगा..."
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8845108695652174
mr
0.8772031204854088
reliable
true
passes
true
2
name
"Hunter"
dialogue
"I'm sorry, I don't understand. Do you speak any English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8033431661750245
no
0.5670995670995671
da
0.5614035087719298
tl
0.5139732685297691
sv
0.51278928136419
cs
0.5
pl
0.464524765729585
hr
0.4528043775649795
sk
0.4227994227994228
ca
0.4038748137108793
de
0.39849624060150374
pt
0.3902439024390244
sq
0.38556067588325654
ro
0.3710691823899371
it
0.34959349593495936
nl
0.3079584775086505
vi
0.30313588850174217
yo
0.2920353982300885
is
0.25512104283054005
sl
0.23076923076923078
ms
0.15254237288135594
be
0.14893617021276595
tr
0.14893617021276595
hu
0.14346895074946467
az
0.13606911447084233
et
0.13606911447084233
lt
0.13606911447084233
lv
0.13606911447084233
uk
0.13606911447084233
reliable
true
passes
true
3
name
"Selvi"
dialogue
"मैं बस तस्वीरें खींच रही थी..."
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8837209302325582
mr
0.8823529411764706
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Listen, I need to know if you saw anything unusual. 'Unusual'... do you understand that?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7891566265060241
no
0.5813397129186603
da
0.5769540692989524
sv
0.5390693590869183
cs
0.5308310991957105
tl
0.5179063360881543
pl
0.5111731843575419
sk
0.47968285431119917
hr
0.4615384615384615
sq
0.44089456869009586
de
0.4192477876106194
et
0.3689903846153846
yo
0.3651753325272068
sl
0.36363636363636365
pt
0.36053593179049936
nl
0.358974358974359
lv
0.34045226130653267
ms
0.3269230769230769
ro
0.3018617021276596
is
0.2827868852459016
fi
0.2636746143057504
it
0.24892703862660948
vi
0.24892703862660948
lt
0.24132947976878616
hu
0.21875
es
0.21052631578947367
ca
0.2081447963800905
tr
0.20694864048338368
eu
0.1822429906542056
be
0.11764705882352942
az
0.10714285714285714
fr
0.10714285714285714
uk
0.10714285714285714
reliable
true
passes
true
5
name
"Selvi"
dialogue
"कैमरे में... हो सकता है कुछ देखा हो मैंने..."
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8879342547627942
mr
0.8837209302325582
reliable
true
passes
true
6
name
"Hunter"
dialogue
"Camera? Something on the camera? Can you show me?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8119015047879617
tl
0.6600741656365884
fr
0.5627980922098569
nl
0.5385906040268457
ms
0.5315161839863715
es
0.5036101083032491
ca
0.5027124773960217
et
0.4751908396946565
pt
0.46911196911196906
it
0.46393762183235865
sq
0.4510978043912176
fi
0.45
no
0.40347071583514105
yo
0.3982494529540482
sv
0.367816091954023
ro
0.3648960739030023
cs
0.3574766355140187
hr
0.24242424242424243
sl
0.24242424242424243
hi
0.233983286908078
pl
0.233983286908078
da
0.22096317280453256
az
0.21875
tr
0.21875
sk
0.21428571428571427
de
0.207492795389049
reliable
true
passes
true
7
name
"Selvi"
dialogue
"यहां... यह व्यक्ति... यहाँ पर खड़ा था..."
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8793436293436293
mr
0.8754980079681275
reliable
true
passes
true
8
name
"Hunter"
dialogue
"Hunter here. Yes? What now?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8255813953488372
sq
0.5145631067961165
de
0.3697478991596639
da
0.35344827586206895
no
0.34210526315789475
fr
0.32432432432432434
sv
0.3181818181818182
tl
0.3087557603686636
it
0.29577464788732394
nl
0.29577464788732394
pt
0.29577464788732394
ca
0.2822966507177033
ms
0.2822966507177033
ro
0.2822966507177033
sl
0.2822966507177033
tr
0.2822966507177033
reliable
true
passes
true
9
name
"Hunter"
dialogue
"I've got to go. Thank you for your time. If you remember or see anything else, please let someone know."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8250807319698601
tl
0.5888678051865908
da
0.4693877551020409
no
0.4415807560137457
yo
0.38271604938271603
sl
0.35258964143426297
ms
0.33605720122574056
cs
0.3305870236869207
sk
0.2857142857142857
sv
0.2777777777777778
et
0.27616926503340755
pt
0.27616926503340755
pl
0.27211646136618145
hr
0.23529411764705882
nl
0.22248803827751198
fr
0.214975845410628
hu
0.2140266021765417
de
0.1935483870967742
is
0.19154228855721395
vi
0.18238993710691823
lv
0.15803108808290156
lt
0.13793103448275862
ro
0.13333333333333333
sq
0.13333333333333333
es
0.10714285714285714
reliable
true
passes
true