NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY83MQFE4C4RJE5BK65A1C8
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
10.1s
Cost:
$0.000460
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.12%
Dialogue to Total Word Ratio
(details)
49.5621%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the rickety kitchen table, her eyes gleaming under the harsh fluorescent light. Bill paced back and forth, twirling a lockpick like a baton. The blueprint of the suburban mansion was spread out before them, dotted with crayon scribbles from Bill's earlier "artistic interpretation." "Alright, Bill, spill it," Molly said, jabbing a finger at the diagram. "Why a kids' birthday bash? We're thieves, not clowns." Bill stopped pacing and leaned in, his mustache twitching. "Think about it, Mol. Every snot-nosed tyke in a ten-mile radius is showing up with gifts. Toys, gadgets, cash-stuffed envelopes—it's a goldmine! No security, just balloons and bad cake." Molly snorted, sipping her lukewarm coffee. "And parents? You know, the ones who chase you with minivans?" "Disguises, baby! We'll go as party entertainers. I'm the Amazing Billini, magician extraordinaire. You’re... uh, his sultry assistant, Magina." "Magina? That's the best you got? Sounds like a rejected X-Men villain." She smirked, circling a spot on the map. "Fine. We slip in through the backyard at 2 PM sharp, when the piñata's swinging. Kids distracted, parents tipsy on punch." Bill nodded vigorously. "Exactly! I dazzle 'em with card tricks—poof, wallet's gone. You handle the gift table. Swoop in like a hawk, stuff the swag into those oversized clown bags." "Clown bags? Bill, we're not robbing a circus. What if a kid sees us? 'Hey mister, why's your magic wand a crowbar?'" He waved her off. "Improv! Tell 'em it's part of the show. 'Abracadabra, presents disappear!' Then we bolt to the ice cream truck we rented—" "You rented an ice cream truck? With what, Monopoly money?" "Stolen plates, Mol. It's genius. We park it out front, blaring tunes. Escape route: down Elm Street, ditch the truck in the woods, hike to the van." Molly rubbed her temples. "Hike? In heels? And the loot—teddy bears and video games? We gonna fence Elmo dolls on the black market?" Bill grinned, unfazed. "Prime resale! Kids' stuff flies off shelves. Plus, think of the irony: we steal joy from joy-bringers." She leaned back, crossing her arms. "Okay, contingencies. If the birthday boy cries, we abort. No toddler tantrums on my rap sheet." "Deal. And if cops show? 'Officer, we were just here for the cake!' Butter 'em up with a joke." Molly chuckled despite herself. "You're insane, Bill. But yeah, this could work. 50-50 split, and no eating the candy mid-heist." He saluted. "Scout's honor. To the heist of the century—pinata pandemonium!" As Bill launched into an impromptu magic trick with a spoon, Molly shook her head, already plotting how to tweak the plan. Thieves or not, this duo was a disaster waiting to sparkle.
created_at
"2026-02-20T19:23:28.888Z"
2/20/2026, 7:23:28 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2008
moderation_latency
(null)
generation_time
9631
tokens_prompt
77
tokens_completion
710
native_tokens_prompt
216
native_tokens_completion
879
native_tokens_completion_images
0
native_tokens_reasoning
214
native_tokens_cached
154
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0004596
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"f4752264-48ff-ccfd-4ccd-64429c68c945"
is_byok
false
latency
166
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771615408-JAtBMzdXSjgab4uheYWx"
upstream_id
"f4752264-48ff-ccfd-4ccd-64429c68c945"
total_cost
0.0004596
cache_discount
0.0000231
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
453
99.12%
Dialogue to Total Word Ratio
Ratio: 70.42%, Deviation: 0.42%
neededClean
false
wordsTotal
453
wordsDialogue
319
49.5621%