NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7NXB453064YWG5HVQDPMT
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.0s
Cost:
$0.000503
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
96.78%
Dialogue to Total Word Ratio
(details)
97.2053%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the rickety kitchen table in their cramped apartment, a crumpled map of the suburban neighborhood spread out like a battlefield. Bill paced behind her, fidgeting with a half-eaten donut, crumbs dusting his rumpled shirt. The target: a lavish birthday bash for some rich kid turning six, loaded with presents from oblivious guests. "Okay, Bill, listen up," Molly said, tapping the map with a greasy finger. "The party's at the McMansion on Elm Street, starts at noon. We'll hit during the cake-cutting chaos. Kids screaming, parents distracted—perfect cover." Bill stopped pacing, his eyes lighting up like a kid spotting candy. "Cake-cutting? Genius! So we dress as, what, piñata repairmen? I could swing a stick around, knock out a few squealers if they get too close." Molly rolled her eyes. "No swinging sticks, you idiot. We're going in as party clowns. I found these outfits at the thrift store—red noses, floppy shoes, the works. You'll be 'Captain Chuckles,' I'll be 'Jolly Judy.' We mingle, distract with balloon animals, then stuff the gifts into oversized prop bags." "Balloon animals?" Bill chuckled, inflating his cheeks mockingly. "I can do a mean dachshund. But what if the birthday brat wants one? Or worse, cries when we don't give him the real pony he probably asked for?" "Improvise," Molly snapped, sketching a quick layout. "The present table's in the backyard tent. We 'entertain' for ten minutes, then boom—sweep the haul. Toys, games, that shiny bike—fence 'em for quick cash. No violence, just sleight of hand." Bill leaned in, squinting at the map. "Sleight of hand? With floppy shoes? I'll trip over my own feet! Remember that jewelry gig? I knocked over the whole display case yelling 'Abracadabra!'" "Exactly why you're on decoy duty," Molly said, smirking. "Waddle around making dumb jokes. 'Why did the balloon go to school? To get inflated!' Keep the rugrats laughing while I load up. Parents won't suspect clowns of grand theft preschool." He grinned, rubbing his chin. "Alright, but escape plan? Backyard gate leads to the alley—we rabbit with the swag in a fake ice cream cart. I saw one online, cheap. Honk the horn, yell 'Free cones!' and vanish." Molly nodded, folding the map. "Smart for once. Timer on your watch—five minutes in the tent, then signal me with a whoopee cushion fart. If cops show, we split: you take the kids' faces with pie, I grab the goods." "Pie in the face? Now that's comedy gold!" Bill laughed, slapping the table. "This'll be our best score yet. Stealing from snot-nosed brats? Poetic justice for all the times we got nothing but coal." "Focus, Bill," Molly warned, standing up. "Rehearse the routine tonight. One slip, and we're the punchline in orange jumpsuits." Bill saluted sloppily. "Aye aye, Captain! To the heist of the century—may the presents be merry and bright!" As they clinked coffee mugs like champagne flutes, Molly couldn't help but smile. In their line of work, laughter was the best disguise.
created_at
"2026-02-20T19:15:58.962Z"
2/20/2026, 7:15:58 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2939
moderation_latency
(null)
generation_time
10735
tokens_prompt
77
tokens_completion
777
native_tokens_prompt
216
native_tokens_completion
984
native_tokens_completion_images
0
native_tokens_reasoning
274
native_tokens_cached
215
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00050295
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"1b99060b-1ab9-0aaa-26a8-b783cb4612c6"
is_byok
false
latency
133
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771614958-adG8EQeFYe1CtjDJ7JYw"
upstream_id
"1b99060b-1ab9-0aaa-26a8-b783cb4612c6"
total_cost
0.00050295
cache_discount
0.00003225
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
96.78%
Dialogue to Total Word Ratio
Ratio: 70.81%, Deviation: 0.81%
neededClean
false
wordsTotal
507
wordsDialogue
359
97.2053%