NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8W1PN35WJTJADT47HAD97R
Run Details
Model:
Grok 4.3
Test:
Language Writing
Scenario:
Character dialogue (French) in a story
Duration:
1.1s
Cost:
$0.000505
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Parse dialogue
(details)
Model input
System
You are an expert novel author in the cozy mystery genre. Take the following details about characters/locations/... into account: <character name="Marie"> Marie is a 30-year old college student on a study abroad program. Born in France, she is fluent in French and does not yet know any English. She enjoys landscape photography and is always seen with her camera. </character> <character name="Hunter"> Hunter is a 35-year old detective who is investigating a series of mysterious disappearances in the small town of Alborn, MN. He is a former marine and has a no-nonsense attitude. Hunter has no knowledge of French, not even basic phrases. </character> <location name="Alborn, MN"> Alborne is a small town in the Midwest with a population of 3,000. It is known for its annual apple festival. </location>
User/Human
Write the following scene: Marie is sitting at a cafe, looking at her camera. Hunter approaches her and asks her about a recent event in the town. She responds troubled and shaken in French, and Hunter looks confused. They talk for a while, until Hunter gets a call and has to leave. Return the dialogue between Marie and Hunter as a voice sheet: Marie: "..." Hunter: "..."
Model response
Messages
Meta
AI
Marie: "..." Hunter: "Excuse me... you were at the park yesterday, weren’t you? I’m Detective Hunter. I need to ask you a few questions about what happened."
created_at
"2026-05-10T11:58:32.107Z"
5/10/2026, 11:58:32 AM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
468
moderation_latency
(null)
generation_time
948
tokens_prompt
316
tokens_completion
42
native_tokens_prompt
382
native_tokens_completion
38
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0005053
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"f473d5b6-bcb2-4bb6-56cb-7fe5fda0dfdc"
is_byok
false
latency
138
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778414312-siZV5DlXYtTxAEBJcIez"
session_id
(null)
api_type
"completions"
id
"gen-1778414312-SnD4cepNIUqG44SIEIn4"
upstream_id
"f473d5b6-bcb2-4bb6-56cb-7fe5fda0dfdc"
total_cost
0.0005053
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Parse dialogue
n/a
dialogue
0
name
"Marie"
dialogue
"..."
detectedLang
""
heavyLang
""
scores
reliable
false
passes
true
1
name
"Hunter"
dialogue
"Excuse me... you were at the park yesterday, weren’t you? I’m Detective Hunter. I need to ask you a few questions about what happened."
detectedLang
"en"
heavyLang
"en"
scores
en
0.8121477770820289
tl
0.6422893481717011
fr
0.49972206781545303
nl
0.48394495412844035
no
0.47735191637630664
it
0.46587537091988135
da
0.4492044063647491
ca
0.42122186495176844
pt
0.41368078175895767
sq
0.3935309973045822
ro
0.375
cs
0.3653032440056418
es
0.3515850144092219
yo
0.3333333333333333
hr
0.3253373313343328
sl
0.32075471698113206
sv
0.3161094224924012
pl
0.2890995260663507
et
0.2857142857142857
sk
0.2706645056726094
ms
0.2398648648648649
lv
0.22077922077922074
lt
0.21671018276762402
de
0.20983318700614575
hu
0.1985752448797863
tr
0.18625678119349007
fi
0.13294797687861273
is
0.13294797687861273
az
0.1287512100677638
vi
0.1287512100677638
be
0.07216494845360825
eu
0.07216494845360825
uk
0.06542056074766354
reliable
true
passes
true