NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95RGVK4MY2JB1F203SS8V2
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
9.5s
Cost:
$0.001057
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
1.04%
Matches word count
(details)
27.59%
Dialogue to Total Word Ratio
(details)
14.3108%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her binoculars, the cheap plastic digging into her eye sockets. Below them, in the manicured backyard of a surprisingly large suburban house, a riot of primary colours was unfolding. A bouncy castle the size of a small hatchback listed precariously, spewing giggling children into the mid-afternoon sun. Streamers drooped from improbably high trees. "Alright, Bill," Molly whispered, her voice a low, gravelly rasp that she’d cultivated after watching too many old detective movies. "Phase one: Infiltration. The target is locked. A veritable cornucopia of… brightly wrapped fiscal obligations." Bill, who was attempting to balance a half-eaten hot dog on his nose, let it tumble into the grass. "Fiscal obligations? Are those the little cake-pops with the sugary hats? Because I've been thinking about those." Molly’s eye twitched. She lowered the binoculars with exaggerated care. "Focus, Bill! We're not here for confectionery. We're here for the motherlode. Boxes. Bags. The loot." She gestured vaguely towards the house, where a woman with terrifyingly bright pink hair was wrangling a small, screaming entity draped in a superhero cape. "Think of it as... acquiring underdeveloped assets." Bill nodded, finally catching on. "Right. Loot. So, how are we getting in? Through the doggy door? 'Cause I saw a little terrier, looked like he could be bribed with a bit of cheese. Very friendly, that one." He mimed offering a slice. "The doggy door is a tertiary option, reserved for emergencies or if the primary ingress fails," Molly recited, ignoring the cheese gesture. "We go through the back patio door. It's electronically locked, but I've got the bypass." She produced a small, blinking device that looked suspiciously like a souped-up TV remote. "Electronic lock? On a *kid's* party?" Bill’s eyes widened, looking genuinely concerned. "What if it’s like, voice-activated? 'Open sesame, happy birthday!' And then it just sings them a song instead of opening? We can't steal presents if the door starts a sing-along, Molly." Molly pinched the bridge of her nose. The sound of a kazoo orchestra briefly drowned out her response. "It's a standard deadbolt with a Bluetooth module, Bill. Not a sentient serenade machine. Once inside, find the main gift repository. Usually the living room or dining table. You handle crowd diversion." "Crowd diversion? You mean like a… surprise party? Jump out, yell 'Surprise!' and while they're screaming, I grab the presents?" Bill’s grin was wide and hopeful. "No!" Molly hissed, a frantic edge creeping into her tone. "You create a *distraction*. A tactical diversion. Maybe spill a drink. Loudly compliment the balloons. Anything to draw eyes away from the prize. Think 'controlled chaos', not 'unwanted guest'." Bill considered this, stroking his chin. A particularly loud shriek of joy emanated from the bouncy castle. "Oh. Right. So, if I… accidentally… knock over the giant inflatable unicorn? Is that too much diversion?" Molly sighed, the pneumatic release of a deflating balloon. "Just don't get caught, don't get glitter on my blueprints, and for the love of all that is holy, don't eat the evidence." She pulled her binoculars back up, her focus returning to the brightly coloured chaos. "Let's move when the ice cream truck arrives. Maximum distraction."
created_at
"2026-02-25T01:14:06.074Z"
2/25/2026, 1:14:06 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1193
moderation_latency
(null)
generation_time
9507
tokens_prompt
77
tokens_completion
1892
native_tokens_prompt
67
native_tokens_completion
2625
native_tokens_completion_images
0
native_tokens_reasoning
1864
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0010567
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1193
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771982046-9GrhIeMNvZz7Iu0XrNQ8"
upstream_id
(null)
total_cost
0.0010567
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
1.04%
Matches word count
n/a
neededClean
false
words
526
27.59%
Dialogue to Total Word Ratio
Ratio: 55.08%, Deviation: 5.08%
neededClean
false
wordsTotal
532
wordsDialogue
293
14.3108%