NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94T4K3C34ZH6FBYV6NQZNZ
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
47.8s
Cost:
$0.002767
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
60.28%
Matches word count
(details)
77.09%
Dialogue to Total Word Ratio
(details)
68.6812%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The sticky vinyl booth of "Hank's Greasy Spoon" felt less like a planning room and more like a crime scene waiting to happen, and not in the good way. Molly, all sharp angles and weary cynicism, stared into her lukewarm coffee as if it held the secrets to the universe, or at least a less irritating accomplice. Bill, on the other hand, was practically vibrating with misplaced enthusiasm, his knees knocking a rhythm against the chipped Formica table. He wore a lurid Hawaiian shirt that suggested he’d mistaken their mission briefing for a luau. "Alright, Billy boy. Operation Easter Egg hunt is a go. You remember the objectives?" Molly’s voice dripped with an impatience honed by years of Bill’s spectacular misinterpretations. "Easter Egg hunt! Fantastic! Are we using those little baskets? And do I get to wear the bunny ears? I know a guy who makes them out of actual rabbit fur, it’s very authentic." Bill gestured wildly, nearly knocking over a sugar dispenser. Molly caught it with a flick of her wrist that was both practiced and dismissive. "It's not an Easter egg hunt, it's a children's birthday party. Saturday. The Periwinkle’s place. And no rabbit-fur ears. We're not *terrorizing* the local fauna, just liberating their presents. Specifically, the Mega-Blaster 3000. That’s your primary target." The Mega-Blaster 3000, a glorified toy gun that promised hours of whizz-bang fun, was apparently the must-have item of the season. Molly had seen the commercials. Bill, less so. "Mega-Blaster! Does it shoot lasers? Can I keep one? Just the one?" He wiggled his eyebrows hopefully. Molly resisted the urge to sketch a diagram of a prison cell on his forehead. "It shoots foam darts, Bill. And you can't keep any. Your job is to create a diversion. Make the kids forget all about the shiny new toy they don't have yet. Think mayhem, but family-friendly mayhem. Can you manage that without causing an international incident or setting off the fire alarm?" Bill puffed out his chest, the Hawaiian shirt stretching precariously. He was surprisingly good at 'family-friendly mayhem' when properly motivated. "Mayhem? Family-friendly? Oh, I'm your guy! I could… I could trip myself and fall into the bouncy castle! That always gets a laugh!" His eyes gleamed with a terrifyingly innocent conviction. "No. Absolutely not. The bouncy castle is prime getaway territory. What you *will* do, if all else fails with your 'creative' approach, is sing loudly and badly. Very loudly. About… uh… hot dogs. You know the hot dog song?" Molly’s face contorted as she recalled the earworm. It was the only thing she could think of that might genuinely distract an entire party full of sugar-hyped children. "The hot dog song! *I'm a little hot dog, short and stout!*' Yes! I’ve got this! Consider it… sung! So, after the Mega-Blaster is secured, what's the secondary objective?" He finished his lukewarm coffee in one gulp, a faint green stain blooming on his shirt near the collar. "Cash. Envelopes. And for pity's sake, try not to get a single present *onto* the lawn. We need a clean exit."
created_at
"2026-02-25T00:57:30.473Z"
2/25/2026, 12:57:30 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1374
moderation_latency
(null)
generation_time
47774
tokens_prompt
77
tokens_completion
11987
native_tokens_prompt
67
native_tokens_completion
6902
native_tokens_completion_images
0
native_tokens_reasoning
6173
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0027675
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
1374
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981050-zhlHkjDMWvF0jEddu6fV"
upstream_id
(null)
total_cost
0.0027675
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
60.28%
Matches word count
n/a
neededClean
false
words
515
77.09%
Dialogue to Total Word Ratio
Ratio: 52.28%, Deviation: 2.28%
neededClean
false
wordsTotal
526
wordsDialogue
275
68.6812%