NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7N86KBDRY9TEF8YX2J8WF
Run Details
Model:
Grok 4
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
33.3s
Cost:
$0.015983
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
42.94%
Dialogue to Total Word Ratio
(details)
21.4716%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the cramped motel room, her black turtleneck clashing with the neon Hawaiian shirt she'd thrown on as a disguise. Bill lounged on the sagging bed, munching on stale chips, his eyes glued to a crumpled blueprint of the suburban McMansion where the kids' party was set for tomorrow. "Alright, Bill, listen up," Molly said, jabbing a finger at the blueprint. "The party's at noon. Clowns, balloons, cake—the works. We slip in during the chaos, grab the present pile, and vanish like ghosts. Easy peasy." Bill crunched a chip loudly. "Easy? Molly, it's a kids' party. We're talking sugar-high ankle-biters everywhere. One wrong move and we're buried under a pile of screaming six-year-olds. You really think we can pull this off without ending up on some parent's viral video?" Molly smirked, leaning against the wall. "That's the beauty of it, genius. We go in as entertainers. I've got the clown suits ready—big red noses, floppy shoes, the whole nine yards. You distract 'em with balloon animals, I raid the gift table. Bam! We're out with a haul worth thousands. iPads, bikes, those fancy dolls that cost more than our rent." Bill sat up, wiping crumbs from his shirt. "Clown suits? Molly, I look ridiculous in polka dots. Last time I dressed up, that bank job in Reno, I tripped over my own feet and set off the alarm. Remember? We barely escaped with a piggy bank." She rolled her eyes. "That was because you insisted on the giant chicken outfit. This is different. Kids love clowns! We'll blend right in. Picture it: Little Timmy's birthday bash. Parents too busy snapping photos to notice us swiping the goodies. We stash 'em in the van parked out back, painted like an ice cream truck. Genius, right?" Bill chuckled, shaking his head. "Ice cream truck? Molly, you're a madwoman. What if some kid wants a cone? Do we have sprinkles? And what about the dog? The blueprint shows they've got a yappy terrier named Sparky. One bark and we're toast." Molly waved him off. "Sparky? Please. I've got tranq darts disguised as party poppers. One pop, and Fido's napping like a baby. Focus, Bill. The presents are the jackpot—gift cards, toys, maybe even cash envelopes from rich uncles. We fence 'em downtown, split the profits fifty-fifty. No more ramen noodles for us." He leaned forward, eyes narrowing. "Fifty-fifty? Last heist you took sixty 'cause you said I ate all the getaway snacks. And what if the parents catch on? These suburban types are vicious. I've seen PTA meetings—it's like the mafia with bake sales." Molly laughed, slapping his shoulder. "Quit whining. We'll practice our routine. You say, 'Hey kids, who wants a magic trick?' While I yell, 'Abracadabra, presents disappear!' It's foolproof. Come on, Bill, we're pros. Remember the jewelry store caper? We walked out with diamonds while the guard was busy with our fake mustache prank." Bill grinned reluctantly. "Alright, alright. But if we get chased by a mob of toddlers, you're carrying me out. And no more chicken outfits—ever." "Deal," Molly said, high-fiving him. "Tomorrow, we turn that party into our payday. Clowns stealing presents—who'd believe it?" As they huddled over the blueprint, plotting escape routes and phony accents, the room filled with their scheming chatter, the absurdity of it all making the heist feel like the punchline to a bad joke. But for Molly and Bill, it was just another day in the life of comedic crooks.
created_at
"2026-02-20T19:15:37.31Z"
2/20/2026, 7:15:37 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6611
moderation_latency
(null)
generation_time
33264
tokens_prompt
77
tokens_completion
881
native_tokens_prompt
744
native_tokens_completion
1019
native_tokens_completion_images
0
native_tokens_reasoning
229
native_tokens_cached
682
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0159825
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"5ed3c213-8e63-24c3-5991-c5a029dbd059"
is_byok
false
latency
79
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771614937-jJnPXa3svA2JgADc2TES"
upstream_id
"5ed3c213-8e63-24c3-5991-c5a029dbd059"
total_cost
0.0159825
cache_discount
0.0015345
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
591
42.94%
Dialogue to Total Word Ratio
Ratio: 74.11%, Deviation: 4.11%
neededClean
false
wordsTotal
591
wordsDialogue
438
21.4716%