NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94JNP7F3Q711MMC5RBHS2D
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
26.7s
Cost:
$0.016323
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
68.10%
Matches word count
(details)
99.95%
Dialogue to Total Word Ratio
(details)
84.0264%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly gestured wildly at the crumpled drawing spread across the greasy formica of the kitchen table. It was a child's crayon rendition of a bouncy castle with joyful, if anatomically challenged, stick figures. A hastily scribbled ‘X’ marked the location of the presents. "Bill, walk me through this: Target, St. Augustine's Community Hall. Saturday. Six-year-old's birthday bash." She tapped the drawing with a suspiciously sticky finger. Bill, perched precariously on a stool that groaned under his weight, barely contained his glee. "Operation Sugar Rush, Mol. Parents distracted by sugar-fueled toddlers. Bouncy castle, perfect noise dampener. Genius." He beamed, adjusting a tie-dyed bandanna around his head. Molly arched an eyebrow, a stray crumb clinging to it. "Genius? Or disturbing? Who's Barnaby? Why his presents?" "Barnaby's got rich friends. Trust me. Aunt Carol always goes overboard. Last year? A pony. Priceless." Bill’s eyes, usually a dull grey, sparkled with avarice at the memory of potential equestrian heists. "We are not dognapping," Molly declared, flattening the drawing. "We're relieving Barnaby of his LEGOs." "And the cash in the cards! Twenties in glitter-bombed envelopes. That's the real prize, Molly. Liquid assets." He clapped his hands together, bits of dry skin flaking off. Molly sighed, scrubbing at a mysterious stain on the tablecloth. "Security: Parents, 'Sprinkle-Tits' the clown – your words – and pre-schoolers. How do we get in and out, unseen, with Paw Patrol merchandise?" Bill jumped off the stool, nearly knocking over a stack of empty pizza boxes. "I infiltrate as 'Professor Wiggle-Bottom', renowned balloon animal entertainer." He struck a dramatic pose, stretching out his arms. "Your clown college dog looked like a sausage escaping a balloon." Molly deadpanned, recalling the traumatic experience. "My cat-in-a-hat balloon is legendary!" Bill insisted, pulling a deflated rubber tube from his pocket. "Our Trojan horse. Gifts by the cake. I distract. You… do your thing." He started comically twisting the tube, a grotesque, bulbous shape forming. "My 'thing' is stuffing a garbage bag while Barnaby screams about missing his monster truck?" She envisioned the headlines, followed by awkward parole meetings. "Quick! 'Misdirection of delight'. Focus on spectacle." Bill's hands flew, creating an alarming squeak with the balloon. "They won't notice. 'Whoopsie-daisies' from a balloon dog." "And when Barnaby's mum, Brenda, notices her son's happiness has vanished?" Molly pressed, folding her arms. "Diversion! Cake explosion." Bill puffed up the balloon into a chaotic knot. "Fire alarm. Release all balloons! Children into frenzy! Chaos, Molly. Beautiful, sticky, sugary chaos." He finished his creation: a lopsided, barely recognizable duck. Molly stared at the deformed avian, then at Bill's manic grin. "You enjoy planning these ludicrous schemes more than doing them." "The art, Mol. The dance of deceit. Outsmarting six-year-olds and their overprotective parents." He presented the balloon duck with a flourish. "Alright, Professor Wiggle-Bottom. Just protect your 'cat-in-a-hat' identity." She took the duck, poked it, and it promptly deflated with a sad, tiny fart. Bill looked genuinely heartbroken.
created_at
"2026-02-25T00:53:25.838Z"
2/25/2026, 12:53:25 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1280
moderation_latency
(null)
generation_time
26625
tokens_prompt
77
tokens_completion
3494
native_tokens_prompt
67
native_tokens_completion
6521
native_tokens_completion_images
0
native_tokens_reasoning
5765
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0163226
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1280
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771980805-avNSOxCOaMhkWSMUDxyu"
upstream_id
(null)
total_cost
0.0163226
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
68.10%
Matches word count
n/a
neededClean
false
words
486
99.95%
Dialogue to Total Word Ratio
Ratio: 49.90%, Deviation: 0.10%
neededClean
false
wordsTotal
503
wordsDialogue
251
84.0264%