NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY86PJXK5HM82V1X5H5C80Q
Run Details
Model:
Grok 4
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
32.9s
Cost:
$0.016085
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
75.16%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
37.5778%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
In the flickering glow of a single bulb dangling from the ceiling of their cramped apartment, Molly paced back and forth, her wild red curls bouncing like overexcited springs. Bill slouched on a threadbare couch, nursing a lukewarm beer and scribbling notes on a napkin that looked suspiciously like it came from a fast-food joint. The air smelled of stale pizza and desperation—the perfect ambiance for plotting a heist that would make even the most bumbling cartoon villains proud. "Alright, Bill, listen up," Molly said, slamming her fist into her palm with mock seriousness. "This kids' party is our golden ticket. Little Timmy's turning eight, and that means a mountain of presents—video games, bikes, maybe even cash in those envelopes from cheapskate uncles." Bill snorted, peering over his glasses. "Molly, you want us to crash a kiddie bash? What, dress up as the Easter Bunny? Those rugrats'll spot us faster than a dropped ice cream cone." She grinned, her eyes sparkling with mischief. "Exactly! We'll go in as clowns. Big red noses, floppy shoes—the works. Kids love clowns. We'll blend right in, juggling while we case the joint." "Clowns? Last time I wore makeup, I looked like a deranged mime. And what about the parents? They're not idiots. One wrong move, and we're dodging soccer moms with pepper spray." Molly waved him off, plopping down beside him. "Pfft, parents at these things are too busy guzzling punch and gossiping to notice. We'll slip in during the cake-cutting chaos. You distract with balloon animals—make a giraffe or something ridiculous—and I'll swipe the gift table. Easy peasy." Bill rubbed his chin, skepticism etched on his face. "Balloon animals? I can barely tie my shoes without tripping. And what if there's a piñata? Those things explode like confetti bombs. We'll be buried in candy wrappers before we grab a single toy." "Details, details," Molly shot back, leaning in closer. "Think big, Bill! We're not just thieves; we're artistes of acquisition. Imagine the haul: Legos, action figures, that new drone everyone's raving about. We'll fence 'em online as 'gently loved treasures.'" He chuckled despite himself. "Gently loved? More like freshly pilfered. Alright, say we pull this off—exit strategy? Can't just waddle out in clown gear with sacks of swag." "Simple: We park the van as the party bus. Load up under the guise of 'surprise entertainment delivery.' Honk the horn like it's part of the fun, and vamoose before anyone realizes the presents vanished with the clowns." Bill shook his head, but a sly smile crept across his lips. "You're insane, Molly. But damn if it doesn't sound hilarious. Fine, I'm in. Just promise no face paint that makes me itch." "Deal," she said, high-fiving him. "Tomorrow, we become the Merry Marauders of Mayhem. This party's about to get a lot less festive—for everyone but us." As they toasted with their beers, the napkin plan fluttered to the floor, a blueprint of absurdity destined for either glory or a slapstick getaway chase involving tricycles and water balloons.
created_at
"2026-02-20T19:25:09.098Z"
2/20/2026, 7:25:09 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
9214
moderation_latency
(null)
generation_time
32793
tokens_prompt
77
tokens_completion
792
native_tokens_prompt
744
native_tokens_completion
1035
native_tokens_completion_images
0
native_tokens_reasoning
357
native_tokens_cached
743
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.01608525
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"fe6a0ebf-ce81-f6b0-8790-c017649d29d6"
is_byok
false
latency
101
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771615509-8YAZoPuTYowt4WHkGDEW"
upstream_id
"fe6a0ebf-ce81-f6b0-8790-c017649d29d6"
total_cost
0.01608525
cache_discount
0.00167175
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
75.16%
Matches word count
n/a
neededClean
false
words
513
0.00%
Dialogue to Total Word Ratio
Ratio: 64.91%, Deviation: 34.91%
neededClean
false
wordsTotal
513
wordsDialogue
333
37.5778%