NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY90DCQ09997PZ1CYH9VT3D
Run Details
Model:
Grok 4.1 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
13.3s
Cost:
$0.000503
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
81.57%
Dialogue to Total Word Ratio
(details)
40.7841%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly and Bill huddled in their cramped van outside the community center, blueprints of the party venue spread across the dashboard like a kid's finger-painting disaster. Molly, wiry and wild-eyed, chewed on a licorice whip while Bill, a burly guy with a perpetual five-o'clock shadow, nursed a lukewarm coffee. "Listen up, Bill," Molly hissed, jabbing the map. "Timmy Turner's bash. Spoiled brat's got presents stacked to the ceiling—Xboxen, bikes, drones. We're swiping the lot during the piñata smackdown." Bill snorted, spilling coffee. "Piñata? Kids whackin' candy with sticks? Sounds like our kinda chaos. But how we blend? Show up in ski masks screamin' 'Gimme the loot!'?" Molly grinned, pulling out two clown costumes from a duffel. "Nah, genius. We go in as Bozo and Betty the Birthday Bandits—er, entertainers. You juggle the presents out the back while I distract with balloon animals shaped like getaway cars." Bill tugged on a floppy red nose, which honked. "Juggle? Last time I juggled, I dropped your grandma's urn. And what about the rugrats? Little terrors'll swarm us like piranhas on piñata guts." "Exactly!" Molly cackled. "We'll rig the piñata with knockout gas—kiddie-safe, mostly. Parents pass out from the cake fumes anyway. You grab the gift table, I hot-wire the bounce house for cover. Load it into the van disguised as a prop." Bill scratched his head, wiggling into oversized shoes that squeaked. "Bounce house heist? Brilliant, Mol. But the cake? That five-tier monster's guarded by Aunt Edna types. One wrong move, we're eatin' fistfuls of frosting." Molly waved a fake wand. "Abracadabra, poof! I saw it in half with this. Presents vanish into my magic sack. You play ringmaster: 'Step right up, folks, watch the birthday boy pick his nose—er, pony!' We time it for the limbo—limbo low, visibility zero." Bill practiced a bow, toppling a stack of fake gold coins. "Limbo? I limbo like a hippo on stilts. What if Timmy cries? Kids' waterworks summon cops faster than free ice cream." "Then we cry too!" Molly demoed crocodile tears. "'Wah, we're sad clowns, officer! The presents ran away!' Bail's chump change. Escape's the cakewalk—reverse the party bus outta there, presents piled like a piñata prize." Bill nodded, honking his nose. "Risky, but hilarious. We're not thieves, Mol—we're party poopers extraordinaire. Tonight, Timmy gets socks, we get the socks-off haul!" Molly high-fived him, the van rocking. "To the heist of the century! Candy coating optional." As they squeezed into costumes, laughter echoed, blueprints fluttering like confetti in the wind.
created_at
"2026-02-20T19:39:11.651Z"
2/20/2026, 7:39:11 PM
model
"x-ai/grok-4.1-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1531
moderation_latency
(null)
generation_time
12644
tokens_prompt
77
tokens_completion
748
native_tokens_prompt
216
native_tokens_completion
985
native_tokens_completion_images
0
native_tokens_reasoning
350
native_tokens_cached
215
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00050345
router
(null)
provider_responses
0
endpoint_id
"9821aa0e-dce3-4c27-9716-161359123d0b"
id
"de8e0b31-aa4d-b6b3-9cb7-e53e4c0b5355"
is_byok
false
latency
1336
model_permaslug
"x-ai/grok-4.1-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771616351-9nIEJ7U2v6TwZ9l2wYSJ"
upstream_id
"de8e0b31-aa4d-b6b3-9cb7-e53e4c0b5355"
total_cost
0.00050345
cache_discount
0.00003225
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
431
81.57%
Dialogue to Total Word Ratio
Ratio: 67.98%, Deviation: 2.02%
neededClean
false
wordsTotal
431
wordsDialogue
293
40.7841%