NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-25-01KJ95NF4Q8NXKQ5EMWNS32141
Run Details
Model:
Gemini 2.5 Flash (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
37.6s
Cost:
$0.024668
Provider:
Google AI Studio
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
99.49%
Dialogue to Total Word Ratio
(details)
49.7460%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The flicking neon sign of "Pete's Pizza Palace" cast a sickly green glow into the back alley where Molly and Bill huddled, their clandestine planning session illuminated by the dim streetlights and the occasional flash of a passing car. A discarded pizza box, surprisingly still warm, sat between them on an upturned bin. Bill chewed thoughtfully on a greasy crust, his eyes darting between Molly’s face and her meticulously drawn blueprint – which, upon closer inspection, was just a crayon rendering of a suburban backyard, complete with a joyful, almost menacing, sun. Molly, ever the professional, ignored the haphazard surroundings, her focus solely on the mission: Operation Piñata, a grand, ludicrous scheme. "Right," Molly began, tapping a finger on a splotch labelled 'Bouncy Castle'. "Target: Tiffany Perkins’ fifth birthday. Twenty-seven presents." Bill paused mid-chew, a shudder going through him. "Tiffany? The mustard terror?" "The very same." Molly’s eyes held a faint glint of retribution. "Security: Fifi, the poodle. Distraction needed." Bill perked up. "I can distract! Macarena?" He briefly demonstrated a hip wiggle, nearly toppling the bin. Molly rolled her eyes. "No. Steak for Fifi. We infiltrate." "Excellent!" Bill sketched a T-bone on his arm. "Children? Bouncing, screaming, sticky." He shuddered. "Hazard pay. You are 'Barnaby the Bear'." Bill nearly choked. "Barnaby? I’m Big Bad Wolf!" "Misdirection. Barnaby juggles. I, 'Penelope the Pixie', organize presents." "Penelope? Glitter?" Bill looked distressed. Molly sighed. "No. No trace. Presents in conservatory. Extraction via gnomes." Bill peered closer. "Actual gnomes? Fishing rods?" "Highly detailed. Do not scuff one. Reputation. Professional purloiners." "Right. Gnomes. Got it." Bill swallowed the last bite. Molly’s gaze sharpened. "Strictly presents. No cake, party bags. Focus." Bill rubbed his chin. "Presents only. Slide, for safety?" Molly closed her eyes briefly, then reopened them. "One step at a time, Barnaby. One step at a time." The greasy scent of stale pizza hung heavy in the air as they finalized their absurd, yet meticulously planned, operation. Their most audacious heist yet, all for cheap plastic toys and the sweet taste of revenge. Molly straightened her shoulders, already envisioning the pile of brightly wrapped gifts. Bill, however, was clearly envisioning the slide, perhaps even a quick go on the swings. The night was young, and kindergarten chaos awaited.
created_at
"2026-02-25T01:12:26.013Z"
2/25/2026, 1:12:26 AM
model
"google/gemini-2.5-flash"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1378
moderation_latency
(null)
generation_time
37534
tokens_prompt
77
tokens_completion
3755
native_tokens_prompt
67
native_tokens_completion
9859
native_tokens_completion_images
0
native_tokens_reasoning
9307
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0246676
router
(null)
provider_responses
0
endpoint_id
"16c01e26-9b91-4d22-b487-ed35634b017d"
is_byok
false
latency
1378
model_permaslug
"google/gemini-2.5-flash"
provider_name
"Google AI Studio"
status
200
api_type
"completions"
id
"gen-1771981946-SRKPt9j43jgqUrY62VW8"
upstream_id
(null)
total_cost
0.0246676
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
374
99.49%
Dialogue to Total Word Ratio
Ratio: 30.32%, Deviation: 0.32%
neededClean
false
wordsTotal
376
wordsDialogue
114
49.7460%