NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95VCF2Z4SY28C9XFV3S3C8
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 6s
Cost:
$0.031803
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.11%
Dialogue to Total Word Ratio
(details)
49.5527%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The air in their cramped workshop hung heavy with the scent of old coffee, machine oil, and Molly’s perpetually forgotten toast. Scattered across the scarred workbench were schematics for bank vaults they’d cracked, blueprints of high-security museums they’d considered, and now, strangely, a crayon-marked floor plan of what appeared to be a suburban house. Bill sat polishing a lockpick, his brow furrowed in concentration, the rhythmic *clink-scrape* the only sound in the room until Molly burst in. She was vibrant and disheveled, clutching a handful of glossy party invitations. Her wild red hair seemed to crackle with an unseen energy as she slapped the invitations down, scattering Bill’s painstakingly organized tools. "I've found our next big score!" Bill slowly raised his gaze, the lockpick glinting dangerously. "Does it involve a bounce house?" Molly swept her arm dramatically, revealing the crayon blueprint. "Better! The Montgomery estate. Barnaby III's sixth birthday. Goldmine! Unprotected! Untapped!" Bill let out a long, suffering sigh, pushing a stray blueprint away. "Molly, professional thieves steal from *adults*. Not Paw Patrol." Molly scoffed, snatching a miniature party hat and cramming it onto her head. She gestured wildly at the map. "Bounty: Consoles, scooters! All gathered in a conservatory next to a petting zoo. An actual petting zoo, I checked!" Bill stared at the peculiar combination, bewildered. "And the plan? Clown and goat?" Molly’s grin widened. Her plan, she explained, involved them crashing the party as hired entertainment. "You'll be 'The Great Balzac,' befuddled magician. I, 'Sparkles,' will be in the conservatory. Quick swap. Sleight of hand." Bill ran a hand over his face. He guessed the swap. "For socks?" "Precisely! Decoy presents. Socks, mostly. Documentaries on paint drying. They'll never know! Genius, Bill! Minimal risk! Maximum reward! Besides, we're teaching them about Delayed Gratification. Or the surprising utility of socks." She gestured grandly to a large duffel bag overflowing with suspiciously brick-shaped packages, underlining her point about the potential loot and their clever deception. Bill groaned, rubbing his temples. He expressed his opinion on the moral low-ground of such an operation, likening the swap of a top-tier drone for a DVD about lichen growth to an elaborate scheme to offload Molly's terrible B-sides from her documentary circuit. The thought of it made his head ache. Molly winked, adjusting her party hat. "A little of both, perhaps. Now, rabbit in a hat... practiced your misdirection?"
created_at
"2026-02-25T01:15:39.882Z"
2/25/2026, 1:15:39 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1481
moderation_latency
(null)
generation_time
65510
tokens_prompt
77
tokens_completion
12679
native_tokens_prompt
67
native_tokens_completion
12713
native_tokens_completion_images
0
native_tokens_reasoning
12169
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0318026
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1481
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982139-b0RY9Lf6JnCH0EHBWWoR"
upstream_id
(null)
total_cost
0.0318026
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
396
99.11%
Dialogue to Total Word Ratio
Ratio: 30.42%, Deviation: 0.42%
neededClean
false
wordsTotal
401
wordsDialogue
122
49.5527%