NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-10-01KR8Z1Y4ANXZJH0E4Y9QNV8VR
Run Details
Model:
Grok 4.3
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
7.8s
Cost:
$0.001445
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly handed Bill a crumpled party invitation under the dim glow of the streetlamp. “You sure this is smart, Molly? 40 kids hopped up on sugar and birthday cake? That’s a disaster waiting to happen.” “It’s not the kids we’re after, it’s the loot,” she whispered, leaning closer. “Five gift tables full of brand-new gadgets, toys, and who knows what else—easy pickings if we time it right.” Bill scratched his chin. “And how exactly do we waltz past a bunch of screaming five-year-olds in glitter crowns?” Molly grinned and pulled a child-sized wizard robe from her backpack. “We dress up as entertainers. You wear this, pretend you’re the balloon guy, and I’ll be the magician with a big box we wheel in. Once inside we swap it for an empty box while everyone’s distracted.” “That’s your plan?” Bill looked skeptical. “A balloon animal isn’t exactly stealth mode.” “Relax,” she said, adjusting the robe over his shoulders. “The parents leave in the corner while the kids play games. You make a few squeaky giraffes, I pull a ‘rabbit’ from the hat that’s actually our duffel. Then we quietly wheel the box out the back door.” Bill sighed. “And if someone notices the presents are disappearing?” Molly tapped her temple. “Then we blame it on the mysterious birthday gremlin. Kids love stories—by the time the parents check, we’ll be three blocks away counting our haul.” “Okay, but what about security cams?” “Two old ladies running the party rented a bounce house and hired us on Craigslist. You really think they’re live-streaming the goings-on? We’re golden.” Bill tugged the robe tighter. “And after we grab everything—what do we do with fifty pounds of wrapping paper and plastic dinosaurs?” “Dump the junk in the park dumpster, keep anything worth a fence’s time, and split the money fifty-fifty,” Molly replied, stuffing extra rubber bands in her pocket. “Piece of cake, Bill.” He laughed quietly. “Famous last words before cake goes everywhere.” Molly checked her watch. “Party starts in twenty. Grab your balloon pump—we’re the entertainment, remember?” Bill straightened his ridiculous hat. “Right. Just don’t forget your ‘magic wand’ can’t actually vaporize any parents who catch us.” “Only if they try to stop me,” she said, smiling.
created_at
"2026-05-10T12:51:05.49Z"
5/10/2026, 12:51:05 PM
model
"x-ai/grok-4.3-20260430"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
410
moderation_latency
(null)
generation_time
7716
tokens_prompt
77
tokens_completion
629
native_tokens_prompt
182
native_tokens_completion
514
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
service_tier
"default"
usage
0.0014453
router
(null)
provider_responses
0
endpoint_id
"45623cb8-18e5-4b32-8fc4-85d05439982d"
id
"ec9907fc-dbe5-981a-9043-5c3b8536e459"
is_byok
false
latency
126
model_permaslug
"x-ai/grok-4.3-20260430"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1778417465-gzT91WQAWUT9PsG0LOkK"
session_id
(null)
api_type
"completions"
id
"gen-1778417465-JJwaXGOTTX8zI0Cmiwcj"
upstream_id
"ec9907fc-dbe5-981a-9043-5c3b8536e459"
total_cost
0.0014453
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
375
0.00%
Dialogue to Total Word Ratio
Ratio: 79.06%, Deviation: 29.06%
neededClean
false
wordsTotal
382
wordsDialogue
302
0.0000%