NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ9587WSPQYSPQJ1EHYR3MW0
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
26.8s
Cost:
$0.014198
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
27.17%
Matches word count
(details)
29.24%
Dialogue to Total Word Ratio
(details)
28.2011%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a manicured nail against the laminated blueprints, a map distinctly unsettling in its innocence: a suburban backyard, dominated by an enormous bouncy castle. From the cramped confines of their usual ops room, a dusty attic space above a forgotten bakery, the plan felt particularly surreal. Bill, meanwhile, practically vibrated with excitement, gesturing wildly with a half-eaten jelly doughnut towards various hand-drawn X-marks on the glossy paper. He treated this target with the same gravity usually reserved for state treasuries, a fact Molly found increasingly alarming. "Are you serious, Bill? A bouncy castle party?" Molly deadpanned, raising a perfectly sculpted eyebrow. Bill beamed, crumbs clinging to his chin like tiny, sugary barnacles. "High-value target, Molly! Think of the haul: consoles, pricey LEGO. Barnaby Hastings: the Bezos of pre-school. His parents are loaded. Intelligence suggests a new-gen console already wrapped and stashed near the cake stand, Prime real estate." He pointed with his doughnut, nearly smearing jam across the 'snack station' on the map. Molly sighed, adjusting the elaborate floral scarf masking the intricate array of lock-picking tools strapped to her wrist. Her partner was a master of his craft, but his enthusiasm often outstripped his common sense. "So your genius plan has us infiltrating a toddler dance-off, dressed as a clown and a unicorn, to steal plastic fire trucks?" "Clown and unicorn for infiltration, yes. Total disarming presence. Nobody suspects a unicorn. It's practically invisible! And the 'fire trucks' are merely cover. We're after the console, the big score, the legendary 'PlayStation 6'!" Bill insisted, pulling a shimmering, iridescent unicorn headpiece from a duffel bag and placing it with reverence on the worn wooden table, next to their usual arsenal of night vision goggles and thermal scanners. She stared at the horn, a gleaming plastic menace, then back at the crude drawings. "Nobody suspects a grown woman sweating profusely in a polyester horn helmet trying to pick a lock on a toy chest, either? What about the actual kids? The screaming? The snot?" Bill waved a dismissive hand, nearly knocking over a carefully constructed miniature model of the bouncy castle made from sugar cubes and pipe cleaners. "Minor tactical hazards. We deploy the 'Sleeping Cloud' via ventilation – perfectly safe, non-toxic, strawberry-scented. They’ll just think it’s a lovely afternoon nap. Swift and silent extraction, like a well-oiled machine." Molly pinched the bridge of her nose, fighting the urge to tell him the machine was likely coated in glitter and jam. Of course it was strawberry-scented. This was going to be an operation for the ages, if only for the sheer, unadulterated absurdity of it all. Still, a job was a job, and a new-gen console for their fence certainly paid better than, say, a year's supply of Duplo bricks. She glanced at the garish unicorn headpiece. This would require more gin than usual, and perhaps a highly advanced decontamination shower afterwards.
created_at
"2026-02-25T01:05:12.606Z"
2/25/2026, 1:05:12 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1422
moderation_latency
(null)
generation_time
26727
tokens_prompt
77
tokens_completion
5409
native_tokens_prompt
67
native_tokens_completion
5671
native_tokens_completion_images
0
native_tokens_reasoning
5030
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0141976
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1421
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981512-KJQJBz6akV4ydl3qfFpR"
upstream_id
(null)
total_cost
0.0141976
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
27.17%
Matches word count
n/a
neededClean
false
words
481
29.24%
Dialogue to Total Word Ratio
Ratio: 34.96%, Deviation: 4.96%
neededClean
false
wordsTotal
492
wordsDialogue
172
28.2011%