NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHR219WJA1Q50ATT2VMXMJ9
Run Details
Model:
Grok 4.3 (Reasoning)
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
1m 49s
Cost:
$0.019247
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
Selvi: "मैं बहुत परेशान हूँ! मैं अपनी कैमरे से भूदृश्य की तस्वीरें ले रही थी जब मैंने देखा कि एक व्यक्ति भागता हुआ गायब हो गया।" Hunter: "Miss, I'm Detective Hunter. I'm investigating the disappearances here in Alborn. What do you know about the recent event?" Selvi: "कृपया समझें कि यह बहुत महत्वपूर्ण है। मुझे डर लग रहा है कि ऐसा ही कुछ मेरे साथ भी हो सकता है।" Hunter: "I don't speak any Hindi. You're going to have to find another way to communicate." Hunter: "This is getting me nowhere. Hold on - my phone is ringing. Detective Hunter speaking. Another clue? I have to leave immediately."
created_at
"2026-05-01T12:26:16.78Z"
5/1/2026, 12:26:16 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
109024
moderation_latency
(null)
generation_time
109024
tokens_prompt
315
tokens_completion
1160
native_tokens_prompt
389
native_tokens_completion
7531
native_tokens_completion_images
(null)
native_tokens_reasoning
7367
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.01924655
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"17b2b54d-dfd3-a5cb-e3cb-1ce07b8c7882"
is_byok
false
latency
56
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777638376-WgtwtERS4u7JVgbklwz7"
session_id
(null)
api_type
"completions"
id
"gen-1777638376-bsRBQzrU7dRQgciJU6EU"
upstream_id
"17b2b54d-dfd3-a5cb-e3cb-1ce07b8c7882"
total_cost
0.01924655
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"मैं बहुत परेशान हूँ! मैं अपनी कैमरे से भूदृश्य की तस्वीरें ले रही थी जब मैंने देखा कि एक व्यक्ति भागता हुआ गायब हो गया।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.8836668217775709
mr
0.8785670916818458
reliable
true
passes
true
1
name
"Hunter"
dialogue
"Miss, I'm Detective Hunter. I'm investigating the disappearances here in Alborn. What do you know about the recent event?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.8012820512820513
tl
0.6272246272246272
fr
0.5938155136268344
pt
0.5715865118850193
ro
0.5604083947816222
nl
0.5571428571428572
it
0.519826517967782
sv
0.4954427083333333
ca
0.49280104712041883
de
0.43389335281227176
sl
0.4318181818181818
sq
0.4216417910447761
cs
0.4186046511627907
es
0.4177310293012772
da
0.404304381245196
no
0.40338722093918394
ms
0.37449556093623887
hr
0.36940602115541094
sk
0.35146443514644354
pl
0.27637721755368816
yo
0.2604961832061069
az
0.20675537359263052
lt
0.19605809128630705
lv
0.15669205658324267
is
0.15115005476451257
et
0.1426991150442478
vi
0.1252821670428894
fi
0.07074340527577938
tr
0.07074340527577938
reliable
true
passes
true
2
name
"Selvi"
dialogue
"कृपया समझें कि यह बहुत महत्वपूर्ण है। मुझे डर लग रहा है कि ऐसा ही कुछ मेरे साथ भी हो सकता है।"
detectedLang
"hi"
heavyLang
"hi"
scores
hi
0.87758813654169
mr
0.8678247734138973
reliable
true
passes
true
3
name
"Hunter"
dialogue
"I don't speak any Hindi. You're going to have to find another way to communicate."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8162992651970608
tl
0.6442432082794307
da
0.5366470092670598
it
0.4821092278719397
ca
0.3834080717488789
no
0.37570942111237227
fr
0.36194895591647336
ro
0.34210526315789475
sk
0.33171324422843257
pl
0.32182490752157833
hr
0.3167701863354037
nl
0.301143583227446
cs
0.3002544529262086
et
0.2912371134020619
yo
0.289405684754522
sq
0.28664072632944226
sv
0.27344782034346105
lv
0.25170068027210885
es
0.23717059639389734
lt
0.2329149232914923
is
0.22644163150492264
az
0.1935483870967742
sl
0.18032786885245902
hi
0.12420382165605094
de
0.12280701754385964
pt
0.1143317230273752
be
0.11290322580645161
tr
0.11290322580645161
hu
0.1085899513776337
uk
0.10277324632952692
vi
0.10277324632952692
reliable
true
passes
true
4
name
"Hunter"
dialogue
"This is getting me nowhere. Hold on - my phone is ringing. Detective Hunter speaking. Another clue? I have to leave immediately."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8040119898547383
tl
0.6484698097601322
da
0.5156695156695157
nl
0.5023419203747073
it
0.495249406175772
no
0.495249406175772
fr
0.4498381877022654
ca
0.42450914014895064
de
0.4121715076071923
ms
0.40014114326040934
hr
0.3997175141242938
cs
0.3984430290162774
ro
0.39329050678087085
sk
0.37820043891733723
sq
0.37591776798825255
sv
0.375
et
0.36377245508982037
es
0.3541033434650456
pt
0.35064935064935066
pl
0.3395493395493396
yo
0.3301812450748621
sl
0.32486100079428115
fi
0.26279271465741544
is
0.26151172893136404
lt
0.23007246376811596
hu
0.20634920634920634
vi
0.18738049713193117
lv
0.17874396135265702
tr
0.1317671092951992
hi
0.08993576017130622
be
0.07608695652173912
az
0.06900328587075576
uk
0.06900328587075576
reliable
true
passes
true