NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYAS5ZDVYD2B16XZKYST22C
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
9.4s
Cost:
$0.000501
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the rickety kitchen table in their cramped apartment, a half-eaten pizza slice dangling from her fingers like a limp victory flag. Blueprints of the community center sprawled across the surface, marked up with red Sharpie doodles of cartoonish thieves in balaclavas. Bill paced behind her, his oversized trench coat swishing like a deflating balloon, sweat beading on his forehead despite the chilly autumn draft seeping through the window. "This is genius, Bill," Molly said, tapping a circled spot on the map. "The Little Timmy's Birthday Bash—full of spoiled brats and their guilt-ridden parents' guiltier gifts. Piñatas stuffed with cash equivalents, bikes wrapped in shiny paper. We swoop in, grab the haul, and vanish before the cake's even cut." Bill stopped pacing, rubbing his chin dramatically. "Swoop in how? We're not birds, Mol. And kids? They're like tiny watchdogs with sticky fingers. One scream, and it's chaos. Remember the pet store fiasco? We tripped over those hamsters and ended up owing the owner for therapy." Molly waved him off, her eyes gleaming with manic glee. "Details, schmetails. Plan's simple: You dress as the clown—rented that squirting flower getup online. I'll be the face painter, blending in with the artsy moms. We set up near the present pile, which, per my recon from the PTA Facebook group, is in the gym corner. At 2 PM sharp, when the games start, I cause a distraction—maybe 'accidentally' spill punch on the magician." Bill groaned, slumping into a chair that creaked in protest. "Punch? Last time you 'distracted' with food, we had to flee from angry vegans. And what if the clown suit rips? I look like a walking sausage in that thing." "Perfect camouflage!" Molly shot back, sketching a quick getaway route. "Parents love clowns. Honk your horn, juggle some balls—real ones, not the metaphorical kind—and while they're laughing, I'll stuff the swag into these duffel bags disguised as party favors. We exfiltrate through the back door by the bounce house. Kids'll be too busy bouncing to notice." He leaned in, peering at the map. "Bounce house? That's our cover? What if it deflates? Or worse, what if Timmy's got a guard dog disguised as a pony?" Molly snorted, flicking a pepperoni crumb at him. "No ponies, Bill—it's a unicorn theme. But fine, backup: You fake a pratfall into the cake. Total pandemonium. We grab and go. Think of the fence value—Xbox, dolls, that drone everyone's posting about. We're talking five figures easy." Bill paused, a sly grin cracking his worry lines. "Alright, but if I end up pied in the face for real, you're explaining it to the wife. Er, I mean, our 'handler.'" "Deal," she replied, rolling up the blueprints. "Tomorrow, we rehearse. No more cold feet—this heist's our big top break." As Bill nodded reluctantly, Molly's phone buzzed with a party invite confirmation. The clock ticked toward madness, and in the dim light, their shadows danced like awkward conspirators plotting the silliest score of the season.
created_at
"2026-02-20T20:10:11.831Z"
2/20/2026, 8:10:11 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2708
moderation_latency
(null)
generation_time
9314
tokens_prompt
77
tokens_completion
779
native_tokens_prompt
216
native_tokens_completion
961
native_tokens_completion_images
0
native_tokens_reasoning
268
native_tokens_cached
153
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00050075
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"ef6bdf3b-6566-aefc-32ab-3e3a96e7ac7e"
is_byok
false
latency
71
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771618211-uOmE8uaZMxQ0pyrFBMcf"
upstream_id
"ef6bdf3b-6566-aefc-32ab-3e3a96e7ac7e"
total_cost
0.00050075
cache_discount
0.00002295
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
508
0.00%
Dialogue to Total Word Ratio
Ratio: 64.37%, Deviation: 34.37%
neededClean
false
wordsTotal
508
wordsDialogue
327
47.9934%