NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95RT8P7QQ40Y6PSV9ZFFT3
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
36.8s
Cost:
$0.021883
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
0.01%
Dialogue to Total Word Ratio
(details)
21.6922%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly jabbed a decisive finger at the heavily annotated floor plan. The map, scribbled with crayon-colored arrows and cartoon clowns, was spread across their rickety card table, biscuit crumbs clinging to the edges. “Right, the target: the Montgomery estate tonight. Little Persephone’s ‘Enchanted Forest’ themed party.” “Persephone?” Bill choked on his lukewarm instant coffee, a sputtering sound of pure dismay. He wiped a hand across his mouth, leaving a faint streak of industrial-grade glitter. “Good Lord, Molly, what kind of child even answers to ‘Persephone’?” “The kind whose parents have entirely too much money. Which, for us, means *presents*.” Molly tapped a vibrant purple circle marked ‘Gift Grotto’, a section of the sprawling tent where a mountain of children's treasures was anticipated. “Estimated haul: fifty high-end toys and enough remote control cars for a small, chaotic Grand Prix.” Bill shivered, hugging his mug like a shield against the imagined horrors of the party. “But… a children's party, Molly? It’s a minefield of sugar-fueled toddlers. Stealing a Barbie Dreamhouse from a child who still believes in the Tooth Fairy?” Molly rolled her eyes, a gesture honed by years of Bill’s particular ethical quandaries. “Don’t be sentimental, Bill. Replacements will be had. It's a high-stakes challenge: navigating bouncy castles and potentially agitated petting zoo goats. The presents are just the appetizer. The real prize is the legendary 'Golden Unicorn' piñata." Bill blanched, his face losing its already meager color. “The piñata? Molly, that’s just cheap sweets and probably angry wasps.” “It’s not *just* sweets. It's the sheer audacity! Intel says it’s got a solid gold horn – a family heirloom. Our biggest threat? Bartholomew, the party clown.” Molly traced a crooked line, labeled ‘Sneaky Shortcut,’ from the back gate, past suspiciously manicured kale patches, towards the enormous striped marquee. Bill shuddered, a tremor evident in his entire frame. “Bartholomew? The one with the surprisingly accurate throwing arm?” “The very same. He’s near the present pile. Our plan requires precision, subtlety, and a robust disguise.” Molly pulled out a crude, deeply unsettling drawing of Bartholomew, all exaggerated shoes and an unsettling grin. “A robust disguise that won’t involve me in a frilly tutu, I trust?” Bill asked, eyeing the clown with a distinct unease, clearly reconsidering his career choices. Molly let out a short, mirthless chuckle, already adjusting invisible spectacles. “Better. You’re going as ‘Professor Poofle’, the esteemed bubbleologist. I’ll be your assistant, ‘Sparkle the Fairy’. Total immersion, Bill. Under a particularly robust bubble cannon demonstration, we acquire the goods. And the Golden Unicorn's horn.” Bill groaned, rubbing his temples with both hands, his professional dignity clearly crumbling. “Professor Poofle and Sparkle the Fairy. I knew I should have picked up advanced lock-picking instead of that intensive balloon folding seminar. Molly, are you absolutely sure this isn’t truly undignified?” “Undignified? Bill, we once stole a particularly rare vintage teapot from a cat show, involving a carefully orchestrated feline riot. Nothing is undignified when there’s a solid gold unicorn horn involved.” Molly stood up, already embodying her role, her voice a theatrical whisper. “Now, where did I put my industrial-strength glitter cannon?”
created_at
"2026-02-25T01:14:15.707Z"
2/25/2026, 1:14:15 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1410
moderation_latency
(null)
generation_time
36737
tokens_prompt
77
tokens_completion
4425
native_tokens_prompt
67
native_tokens_completion
8745
native_tokens_completion_images
0
native_tokens_reasoning
8007
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0218826
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1410
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982055-m8yZmAWKUrso0n0FaWmd"
upstream_id
(null)
total_cost
0.0218826
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
517
0.01%
Dialogue to Total Word Ratio
Ratio: 63.93%, Deviation: 13.93%
neededClean
false
wordsTotal
524
wordsDialogue
335
21.6922%