NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95RBHHZRRJRRZW978BQDAA
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
1m 10s
Cost:
$0.006182
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
10.95%
Dialogue to Total Word Ratio
(details)
5.4771%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the battered binoculars, the cracked lenses offering a magnified, slightly distorted view of suburban bliss. The Henderson residence glittered under the midday sun, its garden a verdant expanse punctuated by a violently cheerful bouncy castle – a colossal, inflatable amoeba threatening to engulf the manicured lawn. Through her optics, Molly catalogued the key assets: a churning mass of minuscule, sugar-fortified humans, a trestle table groaning under the weight of brightly wrapped potential. The air vibrated with the deafening symphony of innocent glee. She lowered the binoculars with a decisive click. Bill, perpetually preoccupied with the existential dread of a loose thread on his jumper, offered a vague grunt. Pilfering presents from a birthday party was a new low, even for him. The sheer volume of unopened gifts – a glittering pyramid of consumerist hope, promising hours of juvenile ecstasy – held Molly's professional focus. This wasn't about the thrill of priceless art; it was about liberating bounty, easily fenced, minimal forensic trail. The strategy: await the cake, the subsequent off-key rendition of 'Happy Birthday' forming the sonic distraction for a smooth ingress and egress via the garden's service door. Primary, minor impediment: the aggressively cheerful gnome population stationed with unnerving precision along the path. Their painted smiles seemed to smuggle ceramic malice. Bill shivered, eyes flicking nervously towards the terracotta troops. "They’re watching us, Moll. Judging." Molly sighed, the sound a weary echo. "They’re glazed earthenware, Bill. Their allegiance is to mildew. My concern isn’t ceramic surveillance, but volatile children. What if impatience overrides politeness? We need a diversion." Bill’s eyes, usually dull, ignited with a peculiar spark. "Giant present costume! We burst in, yell ‘Surprise!’, then take them all!" Molly’s eye twitched. "Bill, your proposals possess a charming tendency towards the overtly theatrical. Remember the inflatable badger?" Bill winced. "It looked quite cuddly." Molly cut him off. "No giant novelty apparel, Bill. Nor 'Chainsaw Chuckles'. We employ... a more refined form of chaos." Bill leaned forward, intrigued. "Refined chaos?" Molly stated her revised plan with cool precision. "That vat of lukewarm, beige sludge. We wait for peak sugar frenzy. Then, amidst the maelstrom, a ‘tactical spillage’. A glorious, gooey cascade. They’ll be too busy navigating the terrain to notice the prize egressing.” Bill’s face illuminated with profound revelation. "Beige. It’s magnificent, Molly." Molly allowed a glimmer of satisfaction. "Artistry, Bill. Now, remember your role." Bill nodded. "No costumes. No clowns. No gnomes." Molly gave a curt nod. "Just the beige."
created_at
"2026-02-25T01:14:00.632Z"
2/25/2026, 1:14:00 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1108
moderation_latency
(null)
generation_time
69602
tokens_prompt
77
tokens_completion
11814
native_tokens_prompt
67
native_tokens_completion
15437
native_tokens_completion_images
0
native_tokens_reasoning
14891
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0061815
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1108
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982040-zW9SOH4sQ72gQsTyyXDx"
upstream_id
(null)
total_cost
0.0061815
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
410
10.95%
Dialogue to Total Word Ratio
Ratio: 36.65%, Deviation: 6.65%
neededClean
false
wordsTotal
412
wordsDialogue
151
5.4771%