NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95G8E9SP5NA0TSYQ0DH7FG
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
22.3s
Cost:
$0.002536
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
88.15%
Dialogue to Total Word Ratio
(details)
44.0743%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted the focus on her battered binoculars, her gaze sweeping over the manicured lawn. Balloons bobbed like cheerful, doomed spirits, and the faint scent of artisanal sausage rolls wafted on the breeze. Children, a shrieking, pastel-clad mob, chased each other around an inflatable dragon. “Right then,” Molly whispered, her voice a low rasp. “Phase One complete. We’ve scouted the perimeter, identified the principal, and confirmed the target’s primary objective: maximum present accumulation.” Bill shifted uncomfortably behind a rhododendron bush, his borrowed tweed jacket snagging on a branch. “Are we sure about this, Mol? It’s Little Archie’s third birthday. There’s a clown. A… a *real* clown.” Molly sighed, lowering the binoculars. The distant sound of a kazoo pierced the air. “Bill, we’ve cased nuclear silos and art museums. A fête for a toddler hardly qualifies as high-stakes. The clown is a diversion. The parents are distracted by the sheer inevitability of projectile juice boxes. The presents, however, are a vulnerable, unattended treasure trove waiting to be liberated.” “But… kids,” Bill whimpered, eyeing a particularly loud child attempting to eat a streamer. “They have that… purity. It’s wrong.” “Purity?” Molly scoffed. “His parents probably spent a fortune on that plastic monstrosity he’s wearing. We’re just redistributing wealth, Bill. From the overindulgent to the… opportunistically inclined. Besides, think of the swag. Rumour has it little Archie’s got the latest ‘Glo-Worm 5000’ and a miniature hoverboard that actually hovers enough to escape a nursery.” Bill paled. “A hoverboard? Oh, no. That’s for professionals, Mol. We’re more of a… ‘grab a few handbags and melt away’ kind of outfit.” “Nonsense. You’ll provide the diversion. When the father – the one trying desperately to juggle a camera and a squeaky toy – takes his eyes off the pile for a nanosecond, you unleash Operation: Confetti Cannon.” “Operation Confetti Cannon? What does that involve?” Bill asked, already looking terrified. “You throw that bag of actual confetti we bought at the party supply store into the air. As soon as it starts falling, I move. They’ll blame the wind, the dragon, or that dreadful clown. This is our moment.” Molly grinned, a flash of gold tooth. “It’s elegant, it’s swift, and it targets the most ethically ambiguous loot imaginable.” “And the exit? If the clown sees us?” “Clowns are notoriously bad at distinguishing faces once they’re inside that makeup. We’ll be ghosts. And if not, well, you always have your convincing portrayal of a runaway bouncy castle to fall back on.” Molly nudged him. “Now, get ready. The kazoo solo is building. That’s our cue.” Bill swallowed hard, glancing from the chortling clown to the shimmering tower of presents. "Right. Confetti. For the children."
created_at
"2026-02-25T01:09:35.311Z"
2/25/2026, 1:09:35 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1113
moderation_latency
(null)
generation_time
22091
tokens_prompt
77
tokens_completion
3054
native_tokens_prompt
67
native_tokens_completion
6323
native_tokens_completion_images
0
native_tokens_reasoning
5673
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0025359
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1113
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981775-aE4Hp1HAh8M16r796Wsn"
upstream_id
(null)
total_cost
0.0025359
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
445
88.15%
Dialogue to Total Word Ratio
Ratio: 71.59%, Deviation: 1.59%
neededClean
false
wordsTotal
447
wordsDialogue
320
44.0743%