NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94HHNNCXEK367DP2R4ECVN
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
11.6s
Cost:
$0.000954
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
46.8256%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly ran a finger along a faded diagram of what appeared to be a bouncy castle. Her hideout smelled faintly of cheap gin and shattered dreams. Across the table, Bill gnawed thoughtfully on a pretzel stick. "So, we're really going through with this," Bill said, his voice muffled by pastry. "Stealing presents from a bunch of seven-year-olds?" "Not any seven-year-olds, Bill. Bartholomew Butterfield the Seventh. His parents are, shall we say, *financially robust*. This isn't about sentiment; it's about acquisition. And their security system is surprisingly lax after the fourth juice box is consumed." Molly tapped the diagram. "The primary cash crop, or rather, present pile, will be located in the conservatory, post-cake ceremony." "Right, the conservatory. So, we just waltz in amongst the sticky-handed cherubs and… what? Ask them nicely if we can borrow their new toys?" "Don't be ridiculous. We pose as professional gift-wrappers hired at the last minute. Turns out ‘Brenda from Fancy Wraps’ had a sudden, glitter-induced allergic reaction. We're their emergency replacement. People rarely question efficiency, especially when sugar-addled children are involved." Bill snorted his pretzel. "So, I'm Bartholomew's new favourite gift wrapper? What's my job? Make it look pretty while you pocket the PS5?" "Precisely. You'll be in charge of present distribution to the children immediately after the unwrapping frenzy. Keep them occupied, keep them happy, and most importantly, keep them *away* from the main haul as I extract it. I’ve got these custom, novelty goody bags. They look innocent, but they’re reinforced with Kevlar and have secret compartments." "Kevlar? For a bouncing castle party? Are we expecting miniature ninjas?" "Tactical advantage, Bill. And while you're distributing my Kevlar goody bags, you can initiate the diversion." "Diversion? What kind of diversion? A magic show where the trick is that all the presents disappear?" "Hardly. Too obvious. No, a sudden, spontaneous outbreak of… interpretive dance. You'll be the lead. Think Swan Lake meets a seizure. It’ll shock them into paralysis. Gives me a clean two minutes." Bill puffed out his chest. "Interpretive dance. I think I saw that on a documentary once. Do I get an outfit?" "The outfit is your standard blue overalls, Bill. We're going for low-key, 'working professional'. The dancing serves the purpose. Meanwhile, I'll be working the ventilation shaft. It's small, dusty, and crucially, leads directly to the conservatory's ceiling tiles. I’ll drop down, swap out the presents for these weighted, hollow replicas," Molly produced a stack of suspiciously light boxes, "and make my exit before anyone notices a slight shift in gravitational consistency." "Weighted? So they look like good presents until you try to open them?" "Exactly. Barty will be too busy watching your 'interpretive dance of doom' to notice." Molly raised an eyebrow. "Just… try not to spontaneously combust. And for heaven's sake, Bill, no actual ninjas or sea shanties. Stick to the plan." Bill nodded, a faraway look in his eyes. "Swan Lake. Got it. Will there be applause?"
created_at
"2026-02-25T00:52:48.956Z"
2/25/2026, 12:52:48 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1194
moderation_latency
(null)
generation_time
11582
tokens_prompt
77
tokens_completion
1612
native_tokens_prompt
67
native_tokens_completion
2367
native_tokens_completion_images
0
native_tokens_reasoning
1638
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0009535
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1194
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771980768-AUGcUE7T6uetGiM3N2BB"
upstream_id
(null)
total_cost
0.0009535
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
491
0.00%
Dialogue to Total Word Ratio
Ratio: 84.83%, Deviation: 14.83%
neededClean
false
wordsTotal
501
wordsDialogue
425
46.8256%