NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95RJQEABZ31N621P0N43S7
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
28.2s
Cost:
$0.016488
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
7.41%
Dialogue to Total Word Ratio
(details)
3.7027%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"You're telling me," Bill grumbled from the dirty van, binoculars pressed to his face, "that after actual gold bullion, our next score is... little Kyle's fourth birthday presents?" Molly, tapping impatiently on the dashboard, chirped, "Don't be a pessimist, Billy-boy! Look at the volume! Dozens of boxes, each a future choking hazard of untold value." "Untold value translates to a fiver at a car boot sale," Bill muttered, lowering his binoculars. "Molly, this feels wrong. Even for us. Stealing from toddlers?" "They'll never miss it," Molly declared. "Think of it as wealth redistribution. Plus, the *challenge*. High-security playpen, a gauntlet of sticky fingers and parents high on sugar." "High-security? It's a bouncy castle and a clown named Fizzy-Wizzy," Bill scoffed. "The clown looks like he's already had too many Fizzy-Wizzys." "Exactly! Chaotic element!" Molly’s eyes gleamed. "No predictable patterns. We need a distraction. Something big. Something... fluffy." She gestured towards a pink unicorn balloon bobbing inside the party tent. "No 'Giant Unicorn Diversion'," Bill said. "Last time, Mrs. Henderson's poodle got stuck. We didn't get the Fabergé egg." "Details! This time, it's bolder. My plan: you, disguised as an overlarge party guest, 'lost' and looking for the bathroom. While everyone's distracted, I'll sneak in through the kitchen. Known weak point." "Overlarge party guest?" Bill repeated, aghast. "Molly, I'm forty-seven. Blending in with kindergarteners?" "Your costume's adorable," Molly insisted, pulling out a brightly coloured party hat. "Primary colours, a cap, a lollipop. While you charm the parents, I'll be in the ‘kill zone’." "Kill zone?" Bill whimpered, surveying the innocent garden party. "This is a children’s party, Molly. Not a bank vault." "The principles are identical: locate valuables, neutralize opposition – prevent little ones from seeing us – extract goods. I’ve even got a special 'present sack'. Disguised as a giant teddy bear, naturally." "A giant teddy bear, in summer," Bill sighed. "What if little Kyle catches us?" "Simple," Molly grinned, adjusting her rainbow wig. "We're Santa's helpers, doing an early sweep. Quality control. Builds character, really." Bill just stared at her. "We've hit rock bottom, haven't we?" "Only if 'a lucrative haul of pre-loved plastic' counts as rock bottom," Molly chirped, humming. "Now, code word if trouble strikes: 'Fizzy-Wizzy Needs a Wee-Wee'." "I'm jumping out of this van," Bill mumbled, but he was already reaching for the party hat.
created_at
"2026-02-25T01:14:07.988Z"
2/25/2026, 1:14:07 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1597
moderation_latency
(null)
generation_time
28155
tokens_prompt
77
tokens_completion
2909
native_tokens_prompt
67
native_tokens_completion
6587
native_tokens_completion_images
0
native_tokens_reasoning
5958
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0164876
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1589
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982047-aqWvbC0GsHPscbXUKIca"
upstream_id
(null)
total_cost
0.0164876
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
387
7.41%
Dialogue to Total Word Ratio
Ratio: 77.22%, Deviation: 7.22%
neededClean
false
wordsTotal
395
wordsDialogue
305
3.7027%