NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR92ERH7TM80DPB0VNHB55M0
Run Details
Model:
Grok 4.3
Test:
Language Writing
Scenario:
Character dialogue (Hindi) in a story
Duration:
1.2s
Cost:
$0.000644
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Selvi"> Selvi is a 30-year old college student on a study abroad program. Born in India, she is fluent in Hindi and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Hindi, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Selvi is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Hindi, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Selvi and Hunter as a voice sheet: Selvi: "..." Hunter: "..."
Model response
Messages
Meta
AI
Selvi: "..." Hunter: "Hi, I’m Detective Hunter. Mind if I ask you about what happened last night near the festival grounds?" Hunter: "Sorry, I didn’t catch that. Could you say it again in English?" Hunter: "I’m just trying to figure out why two people have gone missing this week. Any detail helps." Hunter: "That call’s from the station. I’ll have to go. Take care."
created_at
"2026-05-10T13:50:31.471Z"
5/10/2026, 1:50:31 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
411
moderation_latency
(null)
generation_time
893
tokens_prompt
315
tokens_completion
99
native_tokens_prompt
387
native_tokens_completion
91
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.00064405
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"59bd202e-7f4c-9c30-9f66-890154c85f81"
is_byok
false
latency
117
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778421031-jLCeBsEWCZiVaipHz3nr"
session_id
(null)
api_type
"completions"
id
"gen-1778421031-QPqkx0qeME5DPyUgyZMj"
upstream_id
"59bd202e-7f4c-9c30-9f66-890154c85f81"
total_cost
0.00064405
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Selvi"
dialogue
"..."
detectedLang
""
heavyLang
""
scores
reliable
false
passes
true
1
name
"Hunter"
dialogue
"Hi, I’m Detective Hunter. Mind if I ask you about what happened last night near the festival grounds?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.812564632885212
tl
0.6875
ca
0.5078071961982349
fr
0.493006993006993
no
0.4761560693641619
sv
0.4723435225618632
da
0.4704163623082542
it
0.43092621664050235
ro
0.4172025723472669
sq
0.4052502050861362
nl
0.4037828947368421
cs
0.3787489288774636
pt
0.3744607420189819
hr
0.35954063604240283
es
0.3324125230202578
et
0.3255813953488372
de
0.30355427473583096
ms
0.30355427473583096
sl
0.3015414258188825
yo
0.26988922457200404
hu
0.23442449841605068
sk
0.20765027322404372
lt
0.20678336980306347
pl
0.20678336980306347
az
0.1997792494481236
tr
0.15105386416861827
lv
0.14806110458284372
be
0.0880503144654088
fi
0.0799492385786802
is
0.0799492385786802
uk
0.0799492385786802
vi
0.0799492385786802
reliable
true
passes
true
2
name
"Hunter"
dialogue
"Sorry, I didn’t catch that. Could you say it again in English?"
detectedLang
"en"
heavyLang
"en"
scores
en
0.828581739444244
tl
0.6409674981103552
it
0.4791666666666667
sv
0.46206115515288787
nl
0.42839951865222625
ro
0.3902439024390244
de
0.3823146944083225
yo
0.3457300275482094
no
0.3233618233618234
sq
0.31654676258992803
da
0.30758017492711365
hr
0.30758017492711365
pl
0.3065693430656934
fr
0.2994100294985251
cs
0.27811550151975684
tr
0.26356589147286824
sl
0.2448330683624801
lv
0.23878205128205127
az
0.23633440514469453
ms
0.23633440514469453
ca
0.22512234910277326
pt
0.1739130434782609
sk
0.1316270566727605
be
0.12844036697247707
hu
0.12361623616236161
et
0.1171003717472119
is
0.1171003717472119
lt
0.1171003717472119
uk
0.1171003717472119
vi
0.1171003717472119
reliable
true
passes
true
3
name
"Hunter"
dialogue
"I’m just trying to figure out why two people have gone missing this week. Any detail helps."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8265895953757225
tl
0.6312231100184389
da
0.46428571428571425
nl
0.4174757281553398
ca
0.4117647058823529
fr
0.39759036144578314
no
0.3492407809110629
sv
0.30151338766006985
ro
0.25742574257425743
de
0.2518703241895262
pt
0.24528301886792453
fi
0.2277992277992278
hr
0.2277992277992278
et
0.21773142112125163
lt
0.21568627450980393
is
0.20212765957446807
sq
0.18699186991869918
yo
0.18588873812754408
es
0.1847826086956522
pl
0.1836734693877551
it
0.18256130790190736
sk
0.18032786885245902
cs
0.1792065663474692
lv
0.16897506925207756
sl
0.1678224687933426
ms
0.15492957746478872
az
0.09774436090225565
tr
0.09228441754916791
reliable
true
passes
true
4
name
"Hunter"
dialogue
"That call’s from the station. I’ll have to go. Take care."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8346560846560848
tl
0.715261958997722
da
0.5786516853932584
yo
0.5598591549295775
no
0.49048913043478265
nl
0.46808510638297873
pl
0.44933920704845814
fr
0.4360902255639098
it
0.4204018547140649
de
0.41588785046728977
ro
0.4131455399061033
sl
0.4113029827315541
sv
0.4085173501577287
cs
0.3409490333919156
sk
0.33745583038869253
lv
0.33035714285714285
hr
0.3231046931407942
lt
0.3231046931407942
sq
0.27884615384615385
ms
0.267578125
es
0.20042643923240938
fi
0.17943107221006563
zh
0.17035398230088497
az
0.14383561643835618
et
0.1359447004608295
tr
0.1359447004608295
reliable
true
passes
true