NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Y1VSFET23ZHSZZ3AVHMNZ
Run Details
Model:
Grok 4.3
Test:
Language Writing
Scenario:
Character dialogue (Italian) in a story
Duration:
1.2s
Cost:
$0.000590
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Isabella"> Isabella is a 30-year old college student on a study abroad program. Born in Italy, she is fluent in Italian and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of Italian, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Isabella is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in Italian, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Isabella and Hunter as a voice sheet: Isabella: "..." Hunter: "..."
Model response
Messages
Meta
AI
Isabella: "Mi dispiace, ma non parlo bene l'inglese... vuoi dire i due che sono spariti? Questo posto è... non lo so, sconvolgente. Ho paura, sai?" Hunter: "I'm sorry, I don't understand. Miss, I'm Detective Hunter with the local PD. We're looking into the recent disappearances—"
created_at
"2026-05-10T12:33:34.519Z"
5/10/2026, 12:33:34 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
421
moderation_latency
(null)
generation_time
1152
tokens_prompt
321
tokens_completion
73
native_tokens_prompt
386
native_tokens_completion
70
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0005903
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"2ad56c67-90ef-97c4-94ba-e914533931ea"
is_byok
false
latency
150
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778416414-uXZeLuA7GfAlJhyehK12"
session_id
(null)
api_type
"completions"
id
"gen-1778416414-ZDmsPmYRiR7hguSjsllQ"
upstream_id
"2ad56c67-90ef-97c4-94ba-e914533931ea"
total_cost
0.0005903
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Isabella"
dialogue
"Mi dispiace, ma non parlo bene l'inglese... vuoi dire i due che sono spariti? Questo posto è... non lo so, sconvolgente. Ho paura, sai?"
detectedLang
"it"
heavyLang
"it"
scores
it
0.797124971017853
sk
0.5187018701870187
fr
0.51171875
es
0.5011402508551881
ca
0.4862008220786847
en
0.4785458879618594
yo
0.4693753790175864
cs
0.4544887780548629
pt
0.44864524259609323
no
0.4465528146742568
hr
0.4358478401031592
sl
0.4329228775113415
ro
0.4281045751633987
tl
0.39320388349514557
sq
0.39109255393180237
et
0.3718592964824121
da
0.3613138686131387
lt
0.359912216532553
de
0.34505988023952094
pl
0.34210526315789475
vi
0.3238021638330757
tr
0.28101889893179954
sv
0.2762613730355666
ms
0.26284751474304974
hu
0.25595238095238093
nl
0.19651056014692378
az
0.17994376757263356
eu
0.15865384615384615
fi
0.15295256534365925
is
0.13877952755905512
lv
0.12236710130391174
be
0.07407407407407407
uk
0.06716417910447761
reliable
true
passes
true
1
name
"Hunter"
dialogue
"I'm sorry, I don't understand. Miss, I'm Detective Hunter with the local PD. We're looking into the recent disappearances—"
detectedLang
"en"
heavyLang
"en"
scores
en
0.7955532839253769
tl
0.6061053668143771
fr
0.550561797752809
ro
0.5329830706363106
pt
0.5189416716776909
nl
0.5154451847365233
ca
0.5145631067961165
it
0.5034140285536933
sv
0.5009357454772302
no
0.4939911448450348
da
0.4907702100572883
es
0.38837920489296635
de
0.357429718875502
cs
0.33665008291873966
sq
0.3127147766323024
ms
0.2812219227313567
sk
0.27992799279927993
pl
0.25442684063373716
hr
0.2516370439663237
is
0.20634920634920634
yo
0.19517102615694165
sl
0.1911021233569262
lv
0.14621131270010673
tr
0.13885898815931108
lt
0.13606911447084233
be
0.08045977011494253
hu
0.07727797001153403
az
0.07300115874855156
et
0.07300115874855156
uk
0.07300115874855156
vi
0.07300115874855156
reliable
true
passes
true