NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAVDJWHGYBXCDQKQ6YKTTD
Run Details
Model:
Grok 4
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
24.4s
Cost:
$0.010664
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her fake mustache in the dimly lit van parked across from the bustling suburban house, where balloons bobbed like drunken sailors and laughter echoed from the backyard. Bill, her partner in crime, fiddled with a clown costume that was two sizes too small, his gut straining against the polka-dotted fabric. "This is foolproof," Molly whispered, peering through binoculars at the pile of wrapped gifts under a pinata. "Kids' parties are chaos central. We'll blend in as entertainers, grab the loot during the cake smash, and vanish like ghosts." Bill snorted, adjusting his oversized shoes. "Entertainers? I look like a rejected circus reject. What if they ask for balloon animals?" Molly grinned. "Improvise. Twist a giraffe or something. Remember, the big score's in those presents—video games, toys, maybe even cash envelopes." A scream pierced the air as a kid face-planted into the bouncy castle. Bill chuckled. "Perfect distraction. Let's do this before I split these pants." They slipped out, hearts pounding with illicit thrill. Molly led, whispering, "Stick to the plan, no detours for punch." Bill nodded. "Got it. Heist of the century, here we come." As they approached, a toddler waddled over, eyeing Bill's wig. "You funny clown?" Bill froze. "Uh, yeah, kid. Abra-cadabra!" He mimed pulling a quarter from his ear, but dropped it in the grass. Molly stifled a laugh, dragging him toward the gift table. The party swirled around them—screaming children, harried parents, and a piñata bursting with candy. They stuffed presents into oversized sacks disguised as magic bags, dodging sticky hands and flying streamers. One nosy mom approached. "Are you the hired clowns?" Molly beamed. "Absolutely! Watch this!" She juggled three wrapped boxes, nearly dropping one. Bill added a honk from a fake nose. "Ta-da!" They escaped just as the birthday song started, vanishing into the night with their haul, giggling like the fools they were.
created_at
"2026-02-20T20:11:25.161Z"
2/20/2026, 8:11:25 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6472
moderation_latency
(null)
generation_time
24381
tokens_prompt
77
tokens_completion
515
native_tokens_prompt
744
native_tokens_completion
667
native_tokens_completion_images
0
native_tokens_reasoning
234
native_tokens_cached
699
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.01066425
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"87422f52-7f20-0c67-8e06-813e2b63b54d"
is_byok
false
latency
79
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771618285-DCYodElopR9hoFDfwj4D"
upstream_id
"87422f52-7f20-0c67-8e06-813e2b63b54d"
total_cost
0.01066425
cache_discount
0.00157275
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
318
0.00%
Dialogue to Total Word Ratio
Ratio: 32.70%, Deviation: 22.70%
neededClean
false
wordsTotal
318
wordsDialogue
104
0.0000%