NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Pro
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2025-11-18-01KACP8ZMXM16J26VJBDRJY5YR
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly tapped a greasy finger against the blueprint, which was, in fact, the back of a pizza box. The van smelled of old coffee and quiet desperation. “The Bouncy Castle of Doom is our primary obstacle,” she declared, her voice low and conspiratorial. Bill squinted through the smudged windshield at the target: 124 Primrose Lane. The inflatable dragon in the front yard looked less like a doom-bringer and more like it had just received some very bad news. “You mean the sad dragon?” he mumbled around a mouthful of corn chips. “One of its wings is drooping.” She ignored him completely, her entire being focused on the crayon-scrawled diagram before them. The seriousness she applied to the task would have better suited a plan to crack the Bank of England, not Landon’s Sixth Birthday Bash. “Ignore its structural integrity. At 14:30, the magician, ‘Magnifico Marvin,’ begins his set. That’s our window. The children will be mesmerized by his subpar illusions.” “Are you sure? Last week you said the face-painting station was the key distraction.” “Intel was bad,” Molly snapped. “Turns out the face-painter is an art student with a glitter pot. No hypnotic value. Marvin, however, has a rabbit. That’s premium diversion.” Bill licked orange dust from his thumb. He had to admit, the planning was always the worst part. All these code words and schematics for what would inevitably devolve into him tripping over a lawn gnome while she hissed at him from the getaway unicycle she’d insisted on using last time. “Right. Rabbit. So, I go in through the kitchen window you’ve marked ‘Easy Peasy’?” “Negative. We've upgraded. You’ll be disguised as a balloon animal delivery man. I’ve got the uniform in the back. It’s a slightly sticky polo shirt and a squeaky pump.” “A disguise? To steal a remote-controlled car and three Nerf guns? Seems like overkill.” “It’s about professionalism, Bill. We are not common thugs. We are artists. Now, what's the entry phrase?” He sighed, a sound that conveyed a deep, existential weariness. “The party toad has landed.” “No, that was for the retirement village job. This one is ‘The piñata is pregnant.’” “That doesn't even make sense.” “It’s not supposed to make sense! It’s supposed to be memorable. Now, once inside, you bypass the juice box station—do not get bogged down at the juice box station, I'm serious—and proceed to the ‘Pile of Plunder.’” Her pen circled a lopsided rectangle labelled with a skull and crossbones. It was clearly the living room coffee table, now shimmering under a mountain of brightly wrapped presents. “The plunder. Got it. Any chance the plunder includes those little pigs in a blanket?” “Focus, Bill! This is for the ‘Ultimate Lego Millennium Falcon.’ The biggest score of the season.” “You know they sell those at the mall, right?” Molly fixed him with a withering glare that could have soured a Capri Sun from fifty paces. “It's the principle of the thing.”
created_at
"2025-11-18T23:55:19.900872+00:00"
Tuesday, November 18th, 2025 at 11:55:19 PM GMT+00:00
model
"google/gemini-2.5-pro"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1984
moderation_latency
(null)
generation_time
27757
tokens_prompt
68
tokens_completion
1727
native_tokens_prompt
67
native_tokens_completion
3211
native_tokens_completion_images
0
native_tokens_reasoning
2516
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.03219375
api_type
"completions"
id
"gen-1763510091-vSU3Z95cQ0ufWL5Nvo6D"
upstream_id
(null)
total_cost
0.03219375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9840%
Matches word count
n/a
neededClean
false
words
498
34.4206%
Dialogue to Total Word Ratio
Ratio: 54.62%, Deviation: 4.62%
neededClean
false
wordsTotal
498
wordsDialogue
272
67.2023%