NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY9XNS225ASE9JHRHMGZZ8J
Run Details
Model:
Grok 4 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
4.9s
Cost:
$0.000483
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.08%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0424%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly hunched over the rickety kitchen table in their cramped apartment, surrounded by crumpled maps and half-eaten pizza slices. The glow from a single lamp cast long shadows, making her look like a pint-sized mastermind plotting world domination. Bill, her lanky partner in crime, paced back and forth, his oversized trench coat swishing like a deflated parachute. "Alright, Bill," Molly said, jabbing a finger at a doodled floor plan of the McMansion. "The party's at 2 PM sharp. Rich kid's birthday—think piñatas stuffed with cash equivalents. Presents piled high in the living room. We swoop in, grab the loot, and vanish before the cake's even cut." Bill stopped pacing and scratched his stubbled chin. "Swoop in how? It's a kids' party, Mol. Balloons, clowns, screaming rugrats everywhere. We're not exactly blending with the sippy-cup crowd." She grinned, pulling out two floppy clown costumes from a duffel bag. "Disguises! I'll be Wacky Wanda, you'll be Silly Sammy. Horns, big shoes, the works. No one suspects the entertainment." Bill snorted, holding up the red nose. "Me? In this? I look like a deranged tomato. And what about the parents? They're not idiots—probably hired real clowns already." "We crash the act," Molly shot back, her eyes sparkling with mischief. "Show up unannounced, say the agency's double-booked or whatever. Juggle a bit, do some pratfalls. While they're laughing, I distract with balloon animals—" "Balloon animals? You can't even blow up a whoopee cushion without passing out," Bill interrupted, flopping into a chair. "And the presents? How do we haul 'em out? Giant sack over the shoulder like Santa on steroids?" "Exactly!" Molly nodded enthusiastically. "You make the diversion—fake a pie fight or something. I'll snag the gifts through the back door. Load 'em into the van parked in the alley. Easy peasy, lemon squeezy." Bill leaned forward, whispering dramatically. "What if the kids spot us? One of those little monsters tattles, and we're toast. 'Mommy, the clown stole my pony!'" She waved him off. "Kids love clowns. We'll give 'em extra candy to shut 'em up. And if things go south, Plan B: Blame the real clown. Say he framed us in a fit of greasepaint jealousy." Bill chuckled despite himself. "You're insane, you know that? Stealing from a kiddie bash. What's next, robbing the Easter Bunny?" Molly high-fived him across the table. "Hey, if the furries have loot, we're in. Now, practice your honk. We leave at noon." As Bill squeezed the clown horn, letting out a pathetic toot, Molly stifled a laugh. Their heist was ridiculous, but in the world of petty crime, it was the kind of harebrained scheme that just might work—or end in a slapstick chase through a bounce house. Either way, the presents would be theirs, wrapped in comedy gold.
created_at
"2026-02-20T19:55:10.51Z"
2/20/2026, 7:55:10 PM
model
"x-ai/grok-4-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1308
moderation_latency
(null)
generation_time
4578
tokens_prompt
77
tokens_completion
725
native_tokens_prompt
216
native_tokens_completion
945
native_tokens_completion_images
0
native_tokens_reasoning
288
native_tokens_cached
215
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00048345
router
(null)
provider_responses
0
endpoint_id
"c9586d9d-97ff-40ca-ae94-22504d6f9b7e"
id
"781ac350-3621-c9da-5174-3953048cb54e"
is_byok
false
latency
74
model_permaslug
"x-ai/grok-4-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771617310-iREA5UX3GoLjyjTT6qT5"
upstream_id
"781ac350-3621-c9da-5174-3953048cb54e"
total_cost
0.00048345
cache_discount
0.00003225
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.08%
Matches word count
n/a
neededClean
false
words
471
0.00%
Dialogue to Total Word Ratio
Ratio: 58.81%, Deviation: 28.81%
neededClean
false
wordsTotal
471
wordsDialogue
277
0.0424%