NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9YNC7XB8S2E9CHP8DSN86
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
13.7s
Cost:
$0.011406
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
**INT. CAFÉ ON MAIN STREET — ALBORN, MN — AFTERNOON** *Selvi sits at a small table by the window, scrolling through photos on her camera's display. Her hands tremble slightly as she pauses on one image. The bell above the door chimes as Hunter walks in, scanning the room before making his way toward her.* Hunter: "Excuse me, miss. Detective Hunter, Alborn PD. Got a minute?" Selvi: *(startled, looking up)* "क्या? माफ़ कीजिए, मुझे समझ नहीं आया।" *(What? I'm sorry, I don't understand.)* Hunter: *(pausing, brow furrowed)* "Uh... you speak English?" Selvi: *(shaking her head, gesturing helplessly toward her camera)* "मैंने कुछ देखा... जंगल में... मुझे नहीं पता ये क्या था।" *(I saw something... in the woods... I don't know what it was.)* Hunter: "Okay, hold on—slow down. I don't know what you're saying." Selvi: *(voice shaking, pointing at the camera screen)* "ये तस्वीर... मुझे डर लग रहा है।" *(This photo... I'm scared.)* Hunter: *(leaning in, trying to see the screen)* "Is that... can I look at that?" *Selvi hesitates, then slowly turns the camera toward him. On the screen is a blurred photograph—shadows among the trees, something pale in the distance.* Hunter: *(quietly)* "Where was this taken?" Selvi: *(shaking her head again, frustrated tears welling up)* "मुझे अंग्रेज़ी नहीं आती... कोई मदद करेगा?" *(I don't know English... will someone help?)* Hunter: "I—I need someone who can translate. Stay right here, okay? Don't go anywhere." *He holds up a hand, gesturing for her to wait, when his phone buzzes in his pocket. He pulls it out, checking the screen, his jaw tightening.* Hunter: "Hunter." *(pause)* "When?" *(pause)* "I'm on my way." *He looks back at Selvi, torn, then reaches into his jacket for a card and sets it on the table.* Hunter: "I have to go. Find someone—anyone—who speaks your language. I'll be back." *Selvi watches him go, clutching her camera tightly to her chest, eyes still fixed on the disturbing image on the screen.*
created_at
"2026-07-01T07:40:32.019Z"
7/1/2026, 7:40:32 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4180
moderation_latency
(null)
generation_time
13650
tokens_prompt
315
tokens_completion
696
native_tokens_prompt
423
native_tokens_completion
1056
native_tokens_completion_images
(null)
native_tokens_reasoning
76
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011406
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01SWqxdf9idaJZrf9EUU31Kd"
is_byok
false
latency
1954
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891632-6YuGDt9RbhmuOpyIDUpp"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891632-qPRBol2dBq3yUyXSYk3E"
upstream_id
"msg_01SWqxdf9idaJZrf9EUU31Kd"
total_cost
0.011406
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Hunter"
dialogue
"Excuse me, miss. Detective Hunter, Alborn PD. Got a minute?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.726962457337884
ro
0.6393146979260595
it
0.6343692870201096
fr
0.5726495726495726
tl
0.5698924731182796
pt
0.5670995670995671
es
0.5449374288964732
sl
0.5294117647058824
ca
0.5277449822904369
sq
0.5209580838323353
no
0.4910941475826972
de
0.48717948717948717
nl
0.47506561679790027
hr
0.4444444444444444
da
0.4326241134751773
et
0.42775393419170243
cs
0.4108983799705449
sv
0.40298507462686567
ms
0.39849624060150374
fi
0.3442622950819672
sk
0.33884297520661155
pl
0.29577464788732394
hu
0.2805755395683453
lt
0.26605504587155965
tr
0.2537313432835821
yo
0.22928709055876687
is
0.15789473684210525
az
0.14893617021276595
eu
0.14893617021276595
lv
0.14893617021276595
vi
0.14893617021276595
reliable
true
passes
true
1
name
"Selvi"
dialogue
"क्या? माफ़ कीजिए, मुझे समझ नहीं आया।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8826291079812206
mr
0.8715596330275229
reliable
true
passes
true
2
name
"Hunter"
dialogue
"Uh... you speak English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7905027932960894
tl
0.5426829268292682
no
0.35344827586206895
cs
0.34210526315789475
da
0.32432432432432434
de
0.32432432432432434
hr
0.32432432432432434
ms
0.32432432432432434
ro
0.32432432432432434
sv
0.32432432432432434
reliable
true
passes
true
3
name
"Selvi"
dialogue
"मैंने कुछ देखा... जंगल में... मुझे नहीं पता ये क्या था।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.885616242493566
mr
0.8825601879036994
reliable
true
passes
true
4
name
"Hunter"
dialogue
"Okay, hold on—slow down. I don't know what you're saying."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8043599776411403
tl
0.5098039215686274
da
0.4176372712146422
ca
0.40374787052810895
hr
0.3934142114384749
et
0.38271604938271603
pl
0.375
cs
0.33962264150943394
tr
0.3269230769230769
yo
0.31640625
fi
0.3
sk
0.29577464788732394
no
0.29292929292929293
sv
0.2827868852459016
sq
0.2768595041322314
it
0.24731182795698922
fr
0.2374727668845316
ro
0.20454545454545453
hi
0.1935483870967742
be
0.16666666666666666
hu
0.1606714628297362
az
0.15254237288135594
is
0.15254237288135594
lt
0.15254237288135594
lv
0.15254237288135594
uk
0.15254237288135594
vi
0.15254237288135594
reliable
true
passes
true
5
name
"Selvi"
dialogue
"ये तस्वीर... मुझे डर लग रहा है।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8806682577565632
mr
0.8803418803418803
reliable
true
passes
true
6
name
"Hunter"
dialogue
"Is that... can I look at that?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8486377396569122
tl
0.696969696969697
da
0.6767241379310345
no
0.665924276169265
hu
0.5956873315363882
ca
0.589041095890411
yo
0.5798319327731093
nl
0.5145631067961165
is
0.5081967213114754
tr
0.494949494949495
az
0.4827586206896552
hr
0.4809688581314879
sv
0.4791666666666667
sq
0.471830985915493
pl
0.45652173913043476
it
0.43396226415094336
cs
0.4117647058823529
ro
0.375
sk
0.32432432432432434
be
0.3181818181818182
et
0.29577464788732394
lt
0.29577464788732394
lv
0.29577464788732394
uk
0.29577464788732394
vi
0.29577464788732394
reliable
true
passes
true
7
name
"Hunter"
dialogue
"Where was this taken?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8412698412698413
nl
0.6314496314496314
de
0.5677233429394812
tl
0.39999999999999997
no
0.3951612903225807
sv
0.3670886075949367
yo
0.3670886075949367
sk
0.34210526315789475
sl
0.34210526315789475
sq
0.34210526315789475
reliable
true
passes
true
8
name
"Selvi"
dialogue
"मुझे अंग्रेज़ी नहीं आती... कोई मदद करेगा?"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8819444444444444
mr
0.875036753895913
reliable
true
passes
true
9
name
"Hunter"
dialogue
"I—I need someone who can translate. Stay right here, okay? Don't go anywhere."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8045347928068803
tl
0.5819397993311036
sq
0.43438914027149317
no
0.4165694282380397
pl
0.37810945273631835
ca
0.3670886075949367
hr
0.3564993564993565
da
0.3556701030927836
et
0.3429697766097241
ro
0.3394980184940555
it
0.3359893758300133
yo
0.32249322493224936
de
0.3045897079276773
tr
0.29577464788732394
az
0.284692417739628
sv
0.2826398852223816
cs
0.24698795180722888
pt
0.2401215805471125
es
0.2366412213740458
fr
0.23547400611620795
vi
0.23195084485407066
sk
0.2125984251968504
lt
0.20634920634920634
lv
0.20634920634920634
be
0.12280701754385964
eu
0.11816578483245149
hu
0.11816578483245149
is
0.11190053285968028
nl
0.11190053285968028
uk
0.11190053285968028
ms
0.10554561717352416
sl
0.10554561717352416
reliable
true
passes
true
10
name
"Hunter"
dialogue
"I'm on my way."
detectedLang
"en"
heavyLang
"en"
scores
en
0.830220713073005
pl
0.6376811594202898
cs
0.6168582375478927
sk
0.6168582375478927
tl
0.6153846153846154
et
0.6062992125984252
fi
0.6
fr
0.5215311004784688
ca
0.49748743718592964
hr
0.46808510638297873
hi
0.45652173913043476
yo
0.42857142857142855
reliable
true
passes
true
11
name
"Hunter"
dialogue
"I have to go. Find someone—anyone—who speaks your language. I'll be back."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8189786585365854
tl
0.5960884353741497
da
0.5484790874524715
yo
0.4994731296101159
pl
0.43720379146919436
no
0.42143727161997563
lt
0.4186046511627907
et
0.3708609271523179
ro
0.3683510638297872
hr
0.3493150684931507
cs
0.3319268635724332
sk
0.3004418262150221
lv
0.2803030303030303
hu
0.27921092564491656
ms
0.2578125
is
0.23633440514469453
ca
0.22512234910277326
sv
0.22512234910277326
sq
0.22003284072249588
sl
0.2030201342281879
it
0.19491525423728812
de
0.15480427046263345
nl
0.15480427046263345
es
0.1410488245931284
eu
0.1410488245931284
pt
0.1410488245931284
be
0.12844036697247707
fr
0.12844036697247707
tr
0.12844036697247707
az
0.1171003717472119
uk
0.1171003717472119
vi
0.1171003717472119
reliable
true
passes
true