NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94NMSZFJMF3EYWSGA51RY3
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
32.6s
Cost:
$0.018828
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
75.16%
Matches word count
(details)
8.11%
Dialogue to Total Word Ratio
(details)
41.6328%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a manicured but grimy finger against the laminated blueprint spread across the rickety card table. The surface wobbled precariously under the weight of a lukewarm coffee cup and what looked suspiciously like a half-eaten cat biscuit. Bill, predictably, was already deep into his pre-heist jitters, nervously adjusting the brim of his faded baseball cap. His eyes, usually a dull grey, were wide with a nervous intensity that Molly found both endearing and endlessly frustrating. He nursed a can of lukewarm fizzy orange drink, the condensation leaving faint, smudged rings on their carefully laid-out intel documents. The dingy apartment, crammed with various half-baked inventions and forgotten disguises, always smelled faintly of stale pizza and the unique brand of desperation that clung to small-time criminals. Dust motes danced in the single shaft of weak afternoon light filtering through the grimy windowpane, illuminating their audacious plans for a sunny suburban afternoon. "Details, Bill," Molly stated, her voice a low, gravelly command. Bill cleared his throat, nearly choking on a burp of fizzy orange. "Target acquired. 14 Maple Drive, Picket-Fence Estates. Occupant is one Penelope 'Pony Princess' Smith. Age five. Birthday 'gala' in progress." He gestured vaguely at the map with his can top. "Many assets are wrapped." Molly’s green eyes narrowed, a glint appearing in their depths. "Wrapped is good. Less forensic glitter. Perimeter complications?" "Complications? It's a warzone, Molly," Bill whined, slumping slightly in his chair. "Bouncy castle. Operational. Clown. Highly mobile threat." "A clown?" Molly arched an eyebrow, unimpressed. "Joking?" "No joke. Balloons. Unicycle. Menacing," he insisted, shivering dramatically as if the thought alone was giving him chills. Molly considered this, rubbing her chin thoughtfully. "Potential for distraction. Good. Present drop-off zone secure?" "Designated gift table in living room, near sliding glass doors. Low security, high visibility from kitchen-gaggle of moms and dads," Bill reported, his voice briefly regaining a sliver of his professional demeanor. "Exit strategy. No screaming kids, no angry parents, definitely no unnerved clowns on unicycles." Molly leaned forward, her voice dropping to a conspiratorial whisper. "My plan: I go in as a saccharine kid-entertainer. You, my friend, create a bouncy castle diversion." Bill’s eyes practically bulged out of his head, his can nearly slipping from his grasp. "Me? On the castle? My knees, Molly! My pride!" He clutched his chest dramatically. "No, you get *on* it," Molly clarified, a predatory smile playing on her lips. "Make a big, boisterous, embarrassing scene. Draw parents. While they’re all distracted, I grab goods, rendezvous at the van." "But what if I fall? The shame, Molly, the eternal shame!" "You won't fall. You’re a professional. Just don't land on any actual children. That complicates the exfil." Bill sighed heavily, a gust of resignation escaping him. "Alright. Bouncy castle diversion. Tactical flailing. But if I sprain something serious, you’re buying me industrial-strength muscle rub." "Done. Now, about the 'Magical Mandy' disguise," Molly mused, already sketching ideas on the back of a pizza box. "Glitter glue, a truly cheap wand, and a smile that absolutely screams, 'I'm judging all your life choices right now'."
created_at
"2026-02-25T00:55:03.237Z"
2/25/2026, 12:55:03 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1542
moderation_latency
(null)
generation_time
32542
tokens_prompt
77
tokens_completion
3369
native_tokens_prompt
67
native_tokens_completion
7523
native_tokens_completion_images
0
native_tokens_reasoning
6771
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0188276
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1542
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771980903-C2zcTrkbuPhAnFbVuwpA"
upstream_id
(null)
total_cost
0.0188276
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
75.16%
Matches word count
n/a
neededClean
false
words
513
8.11%
Dialogue to Total Word Ratio
Ratio: 42.91%, Deviation: 7.09%
neededClean
false
wordsTotal
522
wordsDialogue
224
41.6328%