NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ94SAT1P643P82826T39K7Y
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
38.8s
Cost:
$0.017785
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
81.25%
Dialogue to Total Word Ratio
(details)
40.6244%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly squinted at the glitter-dusted invitation, a grimace battling the last stubborn sequin clinging to her cheek from a previous, regrettably, sparkly engagement. It served as a stark reminder of the unique challenges their current profession entailed. Across the cramped kitchen table, Bill meticulously unfolded a crude, hand-drawn map of 74 Cherry Blossom Lane. He smoothed the creases with the solemnity usually reserved for blueprints of international diamond syndicates, his spectacles perched precariously on his nose, magnifying his already wide, eager eyes. A half-eaten sticky bun lay beside a pair of dubious binoculars, suggesting recent, covert reconnaissance had involved significant carbohydrate intake. The very air around them hummed with the high stakes of their improbable profession. "Target: Barnaby Henderson's fifth birthday," Molly stated, tapping a perfectly manicured nail against a crayon drawing of a smiling unicorn. "Operation 'Princess Sparkle Rainbow Bash.' Presents." Bill nodded, circling a crudely drawn gazebo with a bright purple marker. "Entry: rear gate, adjacent to bouncy castle. Infiltration window: 2:15 to 2:30. During the puppet show." Molly sighed, pinching the bridge of her nose. "Principal obstacle?" She knew. "Merlin the Magnificent," Bill declared, his eyes shining with utterly misplaced concern. "The hired magician. Balloon animals. Misdirection." Molly rolled her eyes. "Bill, we're not stealing a magical artefact. Just plastic. Where’s the actual loot?" "Under the gazebo," Bill recited, pointing a stubby finger on his map. "Intel suggests drones, LEGO, 'Poppy the Talking Pony.' It speaks fluent toddler-ese." "Fascinating," Molly drawled, unconvinced. "Other 'security threats'?" "Parents," Bill sighed. "Mothers. Mimosas. Fathers. Flat-pack toys. And the clown. Potentially armed with water pistols." "The clown is yours," Molly decided. "Distract. I'll extract." Her gaze drifted to the invitation’s RSVP date, long past. Their client was notoriously impatient for shiny new things, regardless of previous ownership. "But... the cake?" Bill ventured, a wistful, almost childlike, look blooming on his face. "Mrs. Henderson makes a mean red velvet." Molly pinched the bridge of her nose. "No cake, Bill. Professional operation. In, out. No sprinkles. No tears. No empty plates. Understood?" Bill nodded, already picturing himself amidst the partygoers, perhaps with a strategically acquired cupcake hidden in his inside pocket. This was going to be their finest hour; even if it involved dodging rogue juice boxes and navigating a sea of tiny, sticky fingerprints, the haul would be glorious and the client thrilled. The sheer audacity of their plan, targeting the innocent joy of a child's birthday, was, in its own twisted way, a true masterpiece of comedic villainy.
created_at
"2026-02-25T00:57:04.072Z"
2/25/2026, 12:57:04 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1303
moderation_latency
(null)
generation_time
38718
tokens_prompt
77
tokens_completion
8916
native_tokens_prompt
67
native_tokens_completion
7106
native_tokens_completion_images
0
native_tokens_reasoning
6507
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0177851
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1303
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981024-TzcNNr9WNMmObAuhH09k"
upstream_id
(null)
total_cost
0.0177851
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
417
81.25%
Dialogue to Total Word Ratio
Ratio: 27.96%, Deviation: 2.04%
neededClean
false
wordsTotal
422
wordsDialogue
118
40.6244%