NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAW2WGHMYWYA41GQ3XZTF1
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
6.0s
Cost:
$0.000464
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
10.0948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the rickety kitchen table in their cramped apartment, a crumpled map of the upscale neighborhood spread out like a defeated picnic blanket. Bill paced behind her, his oversized trench coat swishing comically with each step, as if he were auditioning for a spy parody. The air smelled of stale coffee and desperation, but their eyes gleamed with the kind of mischief that only comes from plotting against the innocent. "Look, Bill, it's perfect," Molly said, jabbing a finger at a circle she'd drawn around Elmwood Manor. "Rich parents throwing a birthday bash for little Timmy or whatever. Balloons, cake, and a mountain of presents that'll make our eyes water. We swoop in, grab the loot, and vanish before the piñata even bursts." Bill stopped pacing and leaned in, his bushy eyebrows knitting together like wooly caterpillars in a brawl. "Swoop in? Molly, it's a kids' party. There'll be clowns, face paint, and a horde of sugar-rushed munchkins. One wrong move, and we're dodging sticky fingers and water balloons. What's the entry plan? Parachute from the roof?" Molly snorted, sketching a crude diagram of the backyard. "Nah, too flashy. We'll pose as entertainers. You know, the balloon animal guy. I saw this tutorial online—twist a few dachshunds, keep 'em busy. Meanwhile, I'll be the 'gift coordinator,' shuffling boxes to the van under the guise of organizing the pile." Bill chuckled, but it came out more like a wheeze. "Balloon animals? Me? Last time I tried that, it looked like a deflated giraffe with a grudge. And what if some pint-sized detective asks for a unicorn? I'll end up popping the whole operation." "Improvise!" Molly shot back, her voice rising with excitement. She mimed twisting an invisible balloon, nearly knocking over her coffee mug. "Distract with dad jokes. 'Why did the present go to school? To get wrapped up in learning!' Boom, they're laughing, not looking." Bill rubbed his chin, eyeing the map suspiciously. "And the getaway? The driveway's probably crawling with minivans and soccer moms with yoga pants tighter than our alibis. We need a diversion. Maybe release a flock of pigeons? Or better—fake a unicorn sighting. Kids go nuts for that mythical crap." Molly's eyes lit up. "Pigeons? From where, your pocket? No, stick to the plan: You herd the kiddos for games, I load the van. Timer on your watch—ten minutes tops. If things go south, signal with the chicken dance. No one questions a fool in a clown suit." "Chicken dance?" Bill groaned, but a grin cracked his face. "Fine, but if I end up with cake in my hair and a balloon sword duel scar, you're buying the dry cleaning. This heist better net us enough for a real vacation, not another motel crawl." As the sun dipped low outside their grimy window, Molly folded the map with a flourish, her mind already racing through the chaos. Bill cracked his knuckles, ready to embrace the absurdity. In the world of petty crime, nothing beat turning a toddler's joy into jackpot gold—provided they didn't trip over a tricycle first.
created_at
"2026-02-20T20:11:46.969Z"
2/20/2026, 8:11:46 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1495
moderation_latency
(null)
generation_time
5565
tokens_prompt
77
tokens_completion
782
native_tokens_prompt
216
native_tokens_completion
887
native_tokens_completion_images
0
native_tokens_reasoning
201
native_tokens_cached
154
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0004636
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"31e3e3a1-25b5-4a0c-786e-8af656184ec6"
is_byok
false
latency
538
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771618306-Scoj439jsr9iWVyaA9KJ"
upstream_id
"31e3e3a1-25b5-4a0c-786e-8af656184ec6"
total_cost
0.0004636
cache_discount
0.0000231
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
520
0.00%
Dialogue to Total Word Ratio
Ratio: 58.46%, Deviation: 28.46%
neededClean
false
wordsTotal
520
wordsDialogue
304
10.0948%