NC Bench
Overview
About
Tests
Models
Model
Gemini 2.5 Pro
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2025-11-18-01KACPCXQ06SH5GB95VE5KNT3W
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly lowered the binoculars, a grim set to her jaw. “There it is, Bill. Fortress Birthday.” “Do you think they have those little sausages?” “What?” “You know, the ones in the blanket. Pigs in a blanket. Top-tier party food.” Molly let the binoculars fall against her chest. “We are not here for canapés. We are here for the haul. Synchronize watches.” Bill held up a wrist adorned with a bright blue digital watch. “It’s quarter to unicorn.” “It’s two forty-five, Bill. The clown arrives at three. That’s our window. Now, go over the plan.” “Right. The bouncy castle.” “And?” “We… bounce our way to the presents?” “No,” Molly sighed, the sound long and painful. “The inflation tube. It runs from the generator right past the doggy door. It’s our cover. You slide in, I keep lookout.” “What if the dog is still in there?” “The dog’s name is Princess Fluffybutt. I think you can handle her.” “That’s what they said about The Chihuahua of Grenoble.” “Just focus. The present pile is by the French doors. We’re talking a mountain of potential. Tarquin is turning six. His parents are hedge fund managers. There will be no socks.” “What if there are socks?” Bill asked, his voice a worried whisper. “Really nice ones? With, like, cashmere?” “There will be no socks! Now, the distraction. What’s the distraction?” “The clown,” Bill said confidently. “What kind of clown?” “Does it matter?” “Molly, it always matters. A sad clown? Unpredictable. Might go off-script. A juggler? Too much kinetic energy, things get knocked over. A mime?” He shuddered. “A mime is a silent alarm.” “It’s a balloon animal artist named Mr. Puffles. He’s booked for thirty minutes. You go in the moment he makes the first squeaky dachshund. You grab the biggest boxes first. You’re the pack mule.” “I’m the pack mule,” Bill repeated, nodding slowly. “Got it. You create the diversion.” “No, Mr. Puffles is the diversion. I’m the eye in the sky.” “From the van?” “Yes, from the van.” “So you’re more of an eye in the van.” “Just get the presents, Bill.” “Okay, okay. The signal?” “The second verse of ‘Happy Birthday’.” “There’s a second verse?” Molly pinched the bridge of her nose. “For these people, there is. Now get ready. I see a unicycle.” “Oh no,” Bill groaned. “That’s advanced clowning. This just got complicated.”
created_at
"2025-11-18T23:57:21.081573+00:00"
Tuesday, November 18th, 2025 at 11:57:21 PM GMT+00:00
model
"google/gemini-2.5-pro"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2214
moderation_latency
(null)
generation_time
20282
tokens_prompt
68
tokens_completion
1307
native_tokens_prompt
67
native_tokens_completion
2061
native_tokens_completion_images
0
native_tokens_reasoning
1437
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.02069375
api_type
"completions"
id
"gen-1763510220-gpbK8yQNaOQDZM7VFmgx"
upstream_id
(null)
total_cost
0.02069375
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
396
0.0034%
Dialogue to Total Word Ratio
Ratio: 84.34%, Deviation: 14.34%
neededClean
false
wordsTotal
396
wordsDialogue
334
0.0017%