NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYA3S0BJP0F22E3RB87VFF5
Run Details
Model:
Grok 4.1 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.9s
Cost:
$0.000538
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8722%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the cluttered kitchen table in their cramped apartment, surrounded by crumpled party invitations, a half-eaten pizza, and a suspiciously large duffel bag. Bill paced back and forth, his eyes wild with the gleam of a man who'd just cracked the code to easy money. Pinatas dangled from the ceiling fan like macabre holiday ornaments, courtesy of Bill's "research trip" to the dollar store. "Listen, Moll," Bill said, slapping a glossy invite down. "Rich kids' birthday bash at the McMansion on Elm Street. Parents are loaded—tech bros and yoga influencers. Presents piled high: iPads, drones, those creepy robot dogs. We swipe 'em all, fence 'em by Monday. Boom, retirement fund!" Molly snorted, picking at a pepperoni. "Retirement? Last heist you called 'foolproof' left us dodging mall cops in elf costumes. Kids' party? Screaming brats, helicopter moms, and balloon animals that could double as witnesses." Bill grinned, unfazed, pulling out two garish clown wigs. "That's the genius! We go in as entertainers. I found this flyer: 'Clown duo wanted for gig.' We RSVP as 'Giggles and Grumbles.' Juggle, twist balloons into getaway cars, then—poof!—distract with the piñata smash. While candy flies, we stuff the sack." "Thirty screaming rugrats hyped on sugar?" Molly arched an eyebrow, but her lips twitched. "And the parents? One wrong balloon animal, and we're viral on TikTok: 'Clown Heist Fail.'" Bill waved it off, miming a dramatic flourish. "Timing's key. Cake-cutting at 3 PM—everyone's eyes on frosting. We rig the piñata with a quick-release bottom. Presents under the gift table? We 'accidentally' knock it over during the magic trick. Abracadabra, they're ours! Exit through the bounce house—zip line to the van disguised as the catering truck." Molly leaned back, crossing her arms. "Bounce house zip line? You couldn't zip your fly last week. What about security? Rich folks got cameras, dogs, maybe a nanny cam in the unicorn float." "Dogs love clowns!" Bill barked a laugh. "Peanut butter on the pies distracts Fido. Cameras? We wear these mirrored shades—reflect city. Nanny cam? Blame the ghost of parties past." He struck a pose, wig flopping sideways. She couldn't help chuckling. "Fine, genius. But if a kid pukes on my red nose, you're cleaning it. And no ad-libbing jokes about 'stealing the show'—too on the nose." Bill pumped his fist. "Deal! Rehearsal tonight: you juggle the pizza slices, I'll practice the sack toss." He scooped up the duffel, testing its heft. "This haul's gonna be legendary. Kids cry, we cash in—poetic justice!" Molly shook her head, grabbing a wig. "Poetic larceny, more like. Just don't trip over a toddler on the way out." As Bill launched into a wobbly juggling act with empty beer cans, the fan whirred overhead, piñatas swaying like conspirators. Outside, suburbia slumbered, oblivious to the clown apocalypse brewing. Tomorrow, Elm Street would never be the same—presents vanishing in a whirlwind of honks and helium. Heist o'clock.
created_at
"2026-02-20T19:58:30.422Z"
2/20/2026, 7:58:30 PM
model
"x-ai/grok-4.1-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
404
moderation_latency
(null)
generation_time
15760
tokens_prompt
77
tokens_completion
838
native_tokens_prompt
216
native_tokens_completion
1034
native_tokens_completion_images
0
native_tokens_reasoning
314
native_tokens_cached
151
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00053755
router
(null)
provider_responses
0
endpoint_id
"9821aa0e-dce3-4c27-9716-161359123d0b"
id
"4cf143d7-1e8c-d3bc-3f64-5c78ae5abb5b"
is_byok
false
latency
100
model_permaslug
"x-ai/grok-4.1-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771617510-bVfH7wd6sOs0AP4kq8Uu"
upstream_id
"4cf143d7-1e8c-d3bc-3f64-5c78ae5abb5b"
total_cost
0.00053755
cache_discount
0.00002265
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
496
0.00%
Dialogue to Total Word Ratio
Ratio: 61.49%, Deviation: 31.49%
neededClean
false
wordsTotal
496
wordsDialogue
305
49.8722%