NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94ZBC79Z4T7BQVZQ315XRN
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
1m 19s
Cost:
$0.032943
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
2.32%
Dialogue to Total Word Ratio
(details)
1.1601%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a manicured finger against the shimmering, glitter-encrusted invitation. "Pony Princess Palace," she announced, her tone drier than a week-old birthday cake. "Tiffany's fourth. Pink explosion." Bill, her perpetually apprehensive accomplice, hunched over the sticky tabletop of their cramped apartment. He squinted at a blurry screenshot of the target venue on a budget tablet, a discarded party hat nearby hinting at previous, less successful, reconnaissance. "Pink explosion is mild, Molly. Look at that bouncy castle," Bill grumbled, tapping the screen with a trembling digit. "It's a reinforced bouncy fortress. We’ll be going in there blind, bouncing aimlessly." "It's for four-year-olds, Bill. Sugar is the threat, not security." "Precisely! High-stress, unpredictable! One wrong bounce, we're in unicorn cupcakes and sticky hands." He shuddered theatrically, adjusting his greasy spectacles. The thought alone made him break a sweat. Molly rolled her eyes. "Our target isn't the cupcakes, it's the presents. Operation 'Birthday Bandit' needs precision." She pushed a half-eaten bag of cheese puffs away. "Precision? With Mr. Giggles the Clown lurking? He juggles chainsaws, Molly. Actual chainsaws!" Bill’s voice rose dramatically, terror on his face. "Magic show, you nincompoop. They're plastic. Probably inflatable. Nobody brings real chainsaws to a four-year-old's party." "He has that look. That one-bad-balloon-animal-away-from- snapping look. What if he 'saws the uninvited guest in half'?" "Then we adapt. You distract him. Badger-shaped balloon. I'll sweep gifts." Molly sounded bored with the clown contingency. "A badger? Why?" Bill's voice was a blend of confusion and mild offense. "Obscure. Difficult. Buys time. Imagine his clown face: 'A badger? Sir, poodles and swords only!'" Molly stifled a giggle. "Sedative cookies? My old Aunt Mildred made a killer sedative shortbread back in the day." "No. Ghost-like. Ninja-like. Slip in, grab the boxes, slip out. Before Timmy's monster truck is noticed." "Truck might track us." Bill envisioned a tiny, noisy monster truck convoy breaching their perimeter. "Replace it with a sticky teddy. Parents look cheap." Molly confirmed, without remorse. Bill shuddered. "Monstrous. But effective. What about the parents? The Prosecco-fueled 'Party Patrol'?" "Unicorn Pinata Diversion. A small, accidental ignition. Mass hysteria. Filming flaming unicorn, we vanish." "Flaming unicorn? Genius! Or pyromaniac." Bill stared at her, genuinely impressed and slightly terrified. "Both, usually. Now, for your costume: giant chicken mascot. Nobody questions a giant chicken." "A chicken? Molly, I'm claustrophobic! A pigeon once thought I was turf encroaching!" Bill’s metaphorical feathers ruffled dramatically. "Details, Bill. Details." Molly waved a dismissive hand, already plotting the optimal entry for a large, feathery fowl. The heist of the year was set.
created_at
"2026-02-25T01:00:21.262Z"
2/25/2026, 1:00:21 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1309
moderation_latency
(null)
generation_time
78711
tokens_prompt
77
tokens_completion
17140
native_tokens_prompt
67
native_tokens_completion
13169
native_tokens_completion_images
0
native_tokens_reasoning
12505
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0329426
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1309
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981221-Iiw1LEHU4LZveyyV5Bcm"
upstream_id
(null)
total_cost
0.0329426
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
423
2.32%
Dialogue to Total Word Ratio
Ratio: 58.68%, Deviation: 8.68%
neededClean
false
wordsTotal
438
wordsDialogue
257
1.1601%