NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94T4BFYP7WZHK3V3GQRS3E
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
34.7s
Cost:
$0.020108
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
26.83%
Dialogue to Total Word Ratio
(details)
13.4171%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly’s polished finger tapped a crayon-scribbled diagram, a faint sheen of grease from a forgotten doughnut still clinging to her nail. The blueprints for "Sir Reginald's Magical Medieval Extravaganza" were spread across the dented metal table. Bill, slouched opposite her, chewed on a suspiciously clean pen. "Are you absolutely serious about this, Bill?" Molly asked, her voice a low growl of incredulity. "A bouncy castle perimeter? For a nine-year-old's birthday?" "It's a tactical nightmare, Molly, I know. Think of the shrieking children. But the intel is clear: 'Princess Sparklehoof and her Enchanted Carriage' is the prime present location." Molly rubbed her temples, a familiar headache brewing. This wasn't exactly their usual high-stakes art gallery job. "We're stealing from a child with a pony, Bill. A pony named Princess Sparklehoof, no less." "The pony isn’t the target," Bill insisted, "the presents are. High-value plastic, collectible action figures, the works. And the challenge! That bouncy castle will be legendary." A cold gust of wind rattled their grimy window, making the absurd blueprint flutter. The diagram showed cartoon castles and a "lava pit" near the "loot cavern." "And the parents? Our primary security is a swarm of highly protective, potentially wine-spiked adults. Plus Sir Reginald’s 'Dragon Flyer' drone has thermal imaging. We need a diversion that grounds that thing." "A smoke bomb," Bill suggested, his eyes brightening. "Disguised as a cake explosion. Blinds the drone, confuses the parents." "No glitter bombs, Bill. We’re thieves, not party poopers. What about the 'Dragon' mascot? The teenager in the costume?" "Gary? He's in. Fifty quid, and he won't sing 'Happy Birthday' again. Says it crushes his soul." Molly sighed, marking the 'Dragon' with a check. It was always the simple-minded muscle that kept things moving. She moved her finger to a crudely drawn figure labeled 'Aunt Hilda'. "Aunt Hilda, present-guarding duty?" Molly inquired. "The fuchsia-hatted knitting enthusiast?" "The very same. Obsessed with competitive crochet. We send a rare yarn swatch of 'The Weaver's Anguish'. Timed delivery. That’ll tie her up. Literally." "Excellent. Access point? Kitchen window, still unlocked for 'wizard duels'?" "Confirmed. They take their pretend very seriously. And egress? Not the treehouse zip line, please." "No drama. Quiet out the kitchen window, with sacks of high-value plastic. And Bill, packaging for the unboxing videos. These kids are merciless critics." Molly leaned back, a predatory grin growing. For a children’s party, this was proving to be a surprisingly intricate, and deeply silly, operation. "Details, Molly. Details make the difference." "Right. So, summary: smoke bomb to distract, bribe Gary, yarn-bomb Aunt Hilda, secure presents, egress quietly." "And avoid the glitter, Molly. Always, always avoid the glitter." Bill nodded, a wide, enthusiastic grin on his face. Molly just shook her head.
created_at
"2026-02-25T00:57:30.229Z"
2/25/2026, 12:57:30 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1435
moderation_latency
(null)
generation_time
34687
tokens_prompt
77
tokens_completion
3178
native_tokens_prompt
67
native_tokens_completion
8035
native_tokens_completion_images
0
native_tokens_reasoning
7347
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0201076
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1435
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981050-hy1iUUYu2ioB55omaN1R"
upstream_id
(null)
total_cost
0.0201076
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
453
26.83%
Dialogue to Total Word Ratio
Ratio: 64.87%, Deviation: 5.13%
neededClean
false
wordsTotal
464
wordsDialogue
301
13.4171%