NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95BQ7H7WC7YGPKW5R450AF
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.5s
Cost:
$0.008830
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
4.52%
Dialogue to Total Word Ratio
(details)
2.2616%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill unfolded the crinkled blueprint across the coffee table, poking at a crudely drawn circle. "So, let me get this straight, Molly. The target isn't a bank vault. It's not a museum. It's…" "The legendary 'Happy Hippo Fun Zone'!" Molly chirped, tapping an excited finger on the same spot. "And inside, Bill, a treasure trove beyond imagination! The ultimate score!" "A treasure trove of plastic ponies and slightly damp teddy bears, Molly. We’re talking about a six-year-old’s birthday party. Who even tips us off to this?" Bill grumbled, pinching the bridge of his nose. "Was it a disgruntled clown?" "Details, details! What matters is the intelligence: Penelope 'Penny' Buttercup, daughter of the Buttercup Toys magnate. Her sixth birthday. The guests? All the wealthy parents trying to outdo each other, naturally. Imagine the haul, Bill! Top-tier, untouched merchandise!" Molly practically vibrated with anticipation. "And how, exactly, do we penetrate this fortress of glitter and juice boxes?" Bill gestured vaguely at the blueprint. "They’ve got a bouncy castle. A ball pit. For crying out loud, the main 'vault' is probably just a table in the corner draped with a tablecloth." "Precisely! That's where our unique talents come into play," Molly grinned, pulling out a small, highly detailed sketch. "Phase one: Infiltration via proxy. You, my dear Bill, will be disguised as Barnaby Bumbles, the world-weary children's entertainer." Bill stared at the sketch. It depicted a man in an ill-fitting bee costume. "You want me to… put on the bee outfit? The one with the slightly deflated stinger?" "It’s for camouflage! You’ll blend right in. While you’re distracting the pint-sized security detail with your 'unbearable' puns and balloon animals, I’ll be establishing our extraction route. I’ll be disguised as a 'parent volunteer' – a keen eye for organization, an even keener eye for gift bags." "My puns are not unbearable, Molly. They're profoundly groan-worthy, which is an art form," Bill corrected, though the thought of it made him wince. "But fine. I'm Barnaby. What's the contingency if one of these tiny terrors clocks me? Or, worse, demands a specific animal made from a balloon and I only know 'sausage dog'?" "Diversionary tactics," Molly said with a theatrical flourish, pulling out a small, high-powered confetti canon. "This little beauty is loaded with hypoallergenic, biodegradable, extra-shiny confetti. Good for thirty seconds of pure, unadulterated chaotic distraction. Enough time for you to… improvise." "Improvise. Right. And the presents? The actual objective?" Bill pointed a finger at the present table on the map. "The ultimate prize. They'll be moved to the Buttercups' car, parked just outside the back exit, for transport home. Our window," Molly explained, tapping the small car icon. "A quick swap in the confusion, a decoy bag filled with actual rocks and empty cereal boxes, and we're gone before anyone notices their designer superhero playset has been replaced with a slightly dented tin of shortbread." "Shortbread? Molly, these are kids. They'll notice a swap like that instantly. They have laser vision for new toys," Bill retorted, shaking his head. "And what about surveillance? Overzealous parents with camera phones? Grandma Mildred with her eagle eyes?" "My dear Bill, every professional knows how to handle a challenge. Besides," Molly winked, "who expects a daring high-stakes heist at a party themed 'Sparkling Unicorn Wonderland'?" Bill sighed, tossing the bee costume sketch back onto the table. "You know, for all the planning, I just have a feeling this is going to end with us covered in cake frosting and glitter." "Only if we're doing it right, darling," Molly grinned, already packing the confetti canon. "Only if we're doing it right."
created_at
"2026-02-25T01:07:06.615Z"
2/25/2026, 1:07:06 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1317
moderation_latency
(null)
generation_time
16435
tokens_prompt
77
tokens_completion
2472
native_tokens_prompt
67
native_tokens_completion
3524
native_tokens_completion_images
0
native_tokens_reasoning
2643
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0088301
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1317
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981626-o8E43SPhtDzFFxPSeOtw"
upstream_id
(null)
total_cost
0.0088301
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
601
4.52%
Dialogue to Total Word Ratio
Ratio: 77.87%, Deviation: 7.87%
neededClean
false
wordsTotal
610
wordsDialogue
475
2.2616%