NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Pro
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2025-11-18-01KACPCCY7W3Z3JE3H4XTRG83X
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van, a retired plumbing vehicle that smelled permanently of rust and shattered dreams, was parked across from a scene of absolute suburban carnage. Balloons, garish and swollen, bobbed against the windows of the split-level house. The air thrummed with the high-pitched shrieks of children and the relentless, thumping bassline of a song about a family of sharks. Inside the van, Molly adjusted the focus on her binoculars. “Target is saturated. Hostile forces are distracted by the sugar deployment and auditory assault.” Bill, wedged into the passenger seat, lowered a half-eaten tube of salt-and-vinegar crisps. “You mean the kids are hyper from cake and bad music?” “Precisely. The asset cache is located in the primary social chamber. That’s the living room, Bill. We’ve got a ten-minute window after the candle ceremony and before the professional entertainer arrives.” Molly’s finger traced a line on a crudely drawn map of the house, which was actually the back of a pizza menu. Bill peered at it, his brow furrowed in concentration. The relentless sensory assault from outside was making the van’s thin metal walls vibrate. A small boy, face smeared with blue icing, ran screaming across the lawn, pursued by a girl wielding a foam noodle. It was pure chaos. “I don’t like it, Mol,” Bill said, wiping grease from his chin with the back of his hand. “The intel was incomplete.” “The intel is a Facebook invitation from the birthday boy’s mother. It’s rock solid. What’s the problem?” “The problem,” Bill said, pointing a crisp-dusted finger through the windscreen, “is him.” Molly followed his gaze. A man was climbing out of a comically small car. He wore a rainbow-striped suit, a spinning bow tie, and a floppy hat with a daisy attached. He was pulling a unicycle from the backseat. “That’s the entertainer. Boffo the Clown. He’s a non-factor. A diversion.” “A non-factor? Look at the size of his shoes, Molly. That’s an unnatural size for a foot. He could conceal anything in there. Look at his eyes. He’s seen things.” Molly sighed, a long, weary exhalation that fogged up her side of the windscreen. “He’s a party clown, Bill. His job is to make balloon animals and fall over.” “That’s his cover. He’ll have situational awareness. Nobody who juggles for a living is ever truly relaxed. He’ll spot us for sure.” Bill crunched another crisp, the sound like a gunshot in the tense silence of the van. Outside, Boffo the Clown honked a horn, and the children descended upon him in a squealing, joyful mob. “Fine,” Molly snapped, lowering the binoculars. “We adjust. We use the clown.” “To do what? Create a diversion? He *is* a diversion!” “No,” Molly said, a slow, wicked grin spreading across her face. “We’re going to frame him.”
created_at
"2025-11-18T23:57:08.779643+00:00"
Tuesday, November 18th, 2025 at 11:57:08 PM GMT+00:00
model
"google/gemini-2.5-pro"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2141
moderation_latency
(null)
generation_time
25061
tokens_prompt
68
tokens_completion
1258
native_tokens_prompt
67
native_tokens_completion
2640
native_tokens_completion_images
0
native_tokens_reasoning
1969
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.02648375
api_type
"completions"
id
"gen-1763510203-znTV1q3KfXOmHk0bpAcO"
upstream_id
(null)
total_cost
0.02648375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.2141%
Matches word count
n/a
neededClean
false
words
472
0.5208%
Dialogue to Total Word Ratio
Ratio: 40.25%, Deviation: 10.25%
neededClean
false
wordsTotal
472
wordsDialogue
190
0.3675%