NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9H69XKSDV53EG93GX99S4
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
8.9s
Cost:
$0.000483
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the greasy booth table in the back of the all-night diner, her eyes darting suspiciously at the flickering neon sign outside. Bill slouched across from her, munching on a half-eaten burger, crumbs dusting his ill-fitting trench coat. The plan was absurd, even for them—crashing a kiddie birthday bash to swipe every shiny-wrapped present in sight. But with the holidays around the corner and their rent overdue, desperate times called for pint-sized larceny. "Listen, Bill," Molly whispered, sliding a crumpled map of the neighborhood across the table. "The party's at the Johnsons' place on Elm Street. Big backyard blowout for little Timmy turning six. Balloons, cake, and a mountain of gifts from doting aunts and uncles. We hit it at peak chaos—right after the piñata bursts." Bill wiped ketchup from his chin, grinning like a kid himself. "Piñata? Genius! Kids'll be swarming like piranhas. We dress as clowns, yeah? I got this red nose from that Halloween gig gone wrong." Molly snorted, jabbing the map with her finger. "Clowns? Are you high? Last time you 'acted' in costume, you tripped over your own shoelaces and knocked out the security guard. No, we're going as entertainers. Magicians. I'll be the glamorous assistant, you'll be the bumbling wizard pulling rabbits out of hats—or in our case, presents out of the gift table." Bill leaned in, eyes widening. "Ooh, rabbits! I could really pull off a rabbit. But what if the kids spot us stuffing sacks? They're tiny detectives, Mol. One screams 'thief!' and we're toast." She rolled her eyes, sketching a quick diagram on a napkin. "Distraction's key. You do the magic show—wave your wand, spout nonsense rhymes. While the rugrats are mesmerized, I slip behind the bounce house and load up. We've got ten minutes tops before the parents sober up from the punch. Exit through the side gate, van parked two blocks away." "Sounds solid," Bill said, nodding vigorously. "But the presents—worth it? Could be socks or educational toys. I want gold, Mol. Or at least gift cards." "Idiot," Molly hissed, flicking a fry at him. "It's not about the haul's value; it's volume. Fence 'em online as 'gently used' from a yard sale. We'll clear enough for a month. Just don't go rogue and 'magic' a pony or something." Bill chuckled, flexing imaginary muscles. "Me? Rogue? Nah, I'll stick to the script. Abracadabra, presents gone—poof!" As the waitress refilled their coffees, Molly folded the map, her mind racing through contingencies. Bill's enthusiasm was infectious, but she knew his track record: the botched jewelry store caper where he'd mistaken a mannequin for a guard. Still, in the dim diner glow, the heist felt almost plausible—like a punchline waiting to land. "Tomorrow at noon," she said finally. "Rehearse your patter. And Bill? No real rabbits." He saluted with his burger. "Aye, captain. To the great present pandemonium!" Outside, the city hummed with oblivious night life, while inside, two thieves plotted their silliest score yet.
created_at
"2026-02-20T19:48:21.457Z"
2/20/2026, 7:48:21 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1772
moderation_latency
(null)
generation_time
8298
tokens_prompt
77
tokens_completion
783
native_tokens_prompt
216
native_tokens_completion
926
native_tokens_completion_images
0
native_tokens_reasoning
228
native_tokens_cached
153
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00048325
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"c4a25376-1cec-9d88-8e12-855225df066e"
is_byok
false
latency
106
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771616901-R5J3SwI4gu8VUScTirLt"
upstream_id
"c4a25376-1cec-9d88-8e12-855225df066e"
total_cost
0.00048325
cache_discount
0.00002295
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
0.00%
Dialogue to Total Word Ratio
Ratio: 55.93%, Deviation: 25.93%
neededClean
false
wordsTotal
506
wordsDialogue
283
49.3562%