NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8ZTY0CT4T08S0XHEWG3D7Q
Run Details
Model:
Grok 4.3
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
3.4s
Cost:
$0.000538
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"I'm telling you this is the mother lode," Molly whispered from behind the bushes. "Twenty screaming kids, rich parents that will forget half the gifts in the chaos." Bill adjusted his ill-fitting elf costume. "I still think the elf outfit idea was terrible. These pointy shoes are killing me. How are we supposed to blend in with actual children?" "Blend in? We've got to steal, not socialize, Bill. Twenty kids under the tree, a pile of colorful boxes begging to be grabbed while everyone's distracted by the clown." "The clown?" Bill's eyebrows shot up. "You didn't mention any professional entertainment." "The parents hired some guy in makeup. He'll keep the brats busy with his mediocre magic tricks while we stuff bags full of whatever's under that tree. Easy money." "Until some four-year-old recognizes our faces from last year's Christmas market job," Bill grumbled.
created_at
"2026-05-10T13:04:44.584Z"
5/10/2026, 1:04:44 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
392
moderation_latency
(null)
generation_time
3345
tokens_prompt
77
tokens_completion
228
native_tokens_prompt
182
native_tokens_completion
178
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
128
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0005381
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"e089886e-12a1-9455-8573-356475224946"
is_byok
false
latency
107
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778418284-lHTB6QNLHq5ac0BOSopG"
session_id
(null)
api_type
"completions"
id
"gen-1778418284-w9hhSlMeRF9vIok0WiRN"
upstream_id
"e089886e-12a1-9455-8573-356475224946"
total_cost
0.0005381
cache_discount
0.0001344
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
143
0.00%
Dialogue to Total Word Ratio
Ratio: 86.99%, Deviation: 36.99%
neededClean
false
wordsTotal
146
wordsDialogue
127
0.0000%