NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8X6ZG4P7GNQYD69E95B7N
Run Details
Model:
Grok 4.1 Fast
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
15.7s
Cost:
$0.000458
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
30.47%
Dialogue to Total Word Ratio
(details)
15.2326%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slammed a crumpled birthday invitation onto the scarred diner table, her eyes gleaming under the flickering neon sign. Bill, nursing a coffee that smelled like regret, leaned in, his mustache twitching like a nervous ferret. "Listen up, Bill," Molly hissed. "Rich kid's turning six tomorrow. Mansion on Elm Street. Piñata, pony rides, and a mountain of presents worth a cool fifty grand resale. We're crashing the party." Bill's eyes widened. "A kids' party? Molly, those rugrats are feral! Last heist with munchkins, one bit my ankle. And the parents? Helicopters with wine boxes." "Exactly why it's genius," Molly shot back, jabbing the map. "Chaos cover. We go in as entertainers. You—clown. Me—balloon animal queen. Snag the gift table during cake time." Bill snorted coffee. "Clown? I got a phobia of red noses. They'll smell fake faster than week-old fish sticks. What if the pony kicks my ass?" "Pony's your lookout," Molly grinned. "Tie it up near the exit. Kids chase it, we dash with swag. Presents: Lego Death Stars, iPads, gold-plated bikes. Flip 'em on the dark web by Tuesday." Bill scratched his chin. "Dark web? That's hacker stuff. I can barely work the microwave. And the clown car? Ours is a rusty Pinto. Fits one midget and a horn." "Improvise!" Molly snapped. "Borrow from that circus freak down the block. Park a block away, tunnel in via the hedge maze playground. Timing: Games at noon, cake at one—boom, grab and ghost." "Ghost how?" Bill whined. "Balloon animals as diversion? Twist a giraffe that explodes confetti bombs?" "Better," Molly leaned closer. "Inflate one giant snake, rig it to chase the birthday boy. Total pandemonium. We load sacks into the pony trailer—voilà, thieves like ninjas." Bill guffawed. "Ninjas at a tea party? The clown'll cramp my style. Makeup itches. What if a kid pukes on the wig?" "Suck it up, Bozo. Practice your laugh: Ha-ha-heist! We'll split sixty-forty, my way. You in or chicken?" Bill paused, then smirked. "Fine. But if that pony brays, I'm blaming you. And get me a bigger shoe—size twelve for getaway stomps." Molly high-fived him. "Deal. Tomorrow, we party-crash and present-plunder. World's dumbest heist, biggest score." As the diner clock ticked toward midnight, the two schemers plotted deeper, laughter echoing like bad omens. Outside, rain pattered, washing the streets clean for their absurd caper.
created_at
"2026-02-20T19:37:26.779Z"
2/20/2026, 7:37:26 PM
model
"x-ai/grok-4.1-fast"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
683
moderation_latency
(null)
generation_time
15560
tokens_prompt
77
tokens_completion
699
native_tokens_prompt
216
native_tokens_completion
894
native_tokens_completion_images
0
native_tokens_reasoning
308
native_tokens_cached
215
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00045795
router
(null)
provider_responses
0
endpoint_id
"9821aa0e-dce3-4c27-9716-161359123d0b"
id
"0e34473e-0e91-ce00-7f8a-371fe46bf176"
is_byok
false
latency
471
model_permaslug
"x-ai/grok-4.1-fast"
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771616246-9mY1uTmPI9gY5Iaf5R24"
upstream_id
"0e34473e-0e91-ce00-7f8a-371fe46bf176"
total_cost
0.00045795
cache_discount
0.00003225
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
402
30.47%
Dialogue to Total Word Ratio
Ratio: 74.88%, Deviation: 4.88%
neededClean
false
wordsTotal
402
wordsDialogue
301
15.2326%