NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95A27WNRK55Q9VARG6EMBV
Run Details
Model:
Gemini 2.5 Flash Lite (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
12.1s
Cost:
$0.001265
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The 'borrowed' van, a chameleon of municipal grey, reeked of ambition gone slightly sour, overlaid with the faint, unsettling aroma of cheap disinfectant. Molly, perched on an upturned bucket, traced a route across a crumpled diagram that looked suspiciously like a child’s placemat. Bill, occupying the passenger seat, was meticulously buffing a surprisingly pristine, yet distinctly curved, length of metal with a tea towel embroidered with a cartoon duck. Outside, the saccharine strains of a pop song popular with the under-ten demographic drifted on the breeze, an audible taunt. "Alright, Bill," Molly began, her voice a low purr that didn't quite hide the exasperation. "Operation Sugar Plum Faerie is a go, and the objective is simple: maximum present procurement. We're talking about the holy grail of childhood avarice." Bill beamed, the metal glinting. "You mean the presents, Molly? The colourful rectangles of pure, unadulterated greed that the little darlings haven't even opened yet?" "Precisely. Think of them as treasure. Untouched. Blissfully ignorant of the harsh realities of eBay listings and secondary market value. The intel confirms the target is 'Little Timmy's 7th Birthday Bonanza' at number 17 Primrose Lane. Primary security measures include insufficient adult supervision, a potential choking hazard zone around the party favour bags, and the overwhelming power of the party host's mother." "So, we're just gonna, like, waltz in there like we own the place?" Bill asked, his brow furrowed in concentration. "Do we have a password?" "Bill, we are not storming Fort Knox. We're infiltrating a sugar-fueled riot. You, my dear Bill, will be the eccentric, rumour-has-it-won-the-lottery uncle offering questionable life advice and surprisingly skilled balloon animal impressions. I will be the quiet relative who ‘isn’t feeling well’ and is assigned the important task of ‘keeping an eye on the gifts downstairs’. My speciality is blending in by looking utterly miserable." Molly adjusted her sunglasses, which a sane person would consider overkill for a drizzly afternoon. "Your mission, should you choose to accept it, is reconnaissance. Pinpoint the primary gift cache – usually near the designated destruction zone, I mean, craft table. Then, a subtle signal. We use the provided gift bags as our discrete, ecologically friendly, and highly expendable transport system. Think swift, not strenuous." Bill nodded vigorously. "Got it. Erk-in-essence. And… do they like elephants?" Molly inhaled deeply, counting to ten mentally by the branding on the cheap biscuit tin beside her. "Bill, our objective is to *acquire* the gifts, not to provide existential entertainment. The key is the art of the distraction. While young Kevin is contemplating the structural integrity of his balloon dog, your eyes should be on a suspiciously large box wrapped in that atrocious dinosaur paper. While Mrs. Gable is recounting her award-winning rose pruning techniques, you’ll be subtly transferring the loot into our decoys. It’s about minimal contact, maximum acquisition. Think… kleptomaniac ballet." "So we gotta dance?" Bill whispered, eyes wide. "No, Bill, we do *not* have to dance. We need to be ghosts. Ghosts who happen to be collecting valuable merchandise. Remember, we’re not taking from children. We are, in essence, *rehoming* the bounty. Giving it a second, more appreciative life, far away from sticky fingers and parental guilt. A service, really." Bill’s face lit up with profound understanding. "We're like… delivery squirrels, but for expensive stuff?" Molly simply closed her eyes, a tiny, almost imperceptible smile playing on her lips. "Close enough, Bill. Close enough."
created_at
"2026-02-25T01:06:12.353Z"
2/25/2026, 1:06:12 AM
model
"google/gemini-2.5-flash-lite"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
955
moderation_latency
(null)
generation_time
12059
tokens_prompt
77
tokens_completion
2020
native_tokens_prompt
67
native_tokens_completion
3146
native_tokens_completion_images
0
native_tokens_reasoning
2340
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0012651
router
(null)
provider_responses
0
endpoint_id
"ce839073-aa24-4f29-8358-15b319bd05ec"
is_byok
false
latency
955
model_permaslug
"google/gemini-2.5-flash-lite"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981572-G0UMQfYYi0cn47WFkAtC"
upstream_id
(null)
total_cost
0.0012651
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
567
0.00%
Dialogue to Total Word Ratio
Ratio: 73.48%, Deviation: 23.48%
neededClean
false
wordsTotal
577
wordsDialogue
424
0.0000%