NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Pro (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2025-11-19-01KACQRH03RZ3G411NZRVYP0VQ
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slammed the architectural schematics onto the sticky dashboard of the surveillance van. The layout was drawn in purple crayon on the back of a placemat from authorized personnel only, which in this case meant a diner called O’Malley’s. She tapped the red circle near the patio doors with a manicured nail. This is the extraction point, Molly said, her voice grave. The target is turning six. That means maximum chaos, sticky hands, and a sugar crash window of approximately twenty minutes. Bill squinted at the drawing, nervously adjusting his black turtleneck. He looked less like a cat burglar and more like a beat poet who had lost his bongos. Are we sure about the perimeter defense? Bill asked. I heard they hired Bubbles. Molly rubbed her temples, feeling the start of a tension headache. Bubbles is a clown, Bill. He makes balloon animals. He is not a perimeter defense. Unless you have a paralyzing fear of latex dachshunds, he is a non-factor. Bill shuddered and looked out the window at the suburban backyard adorned with streamers. You never know what they keep in those oversized pockets, he whispered. Focus. The grand prize is sitting on the gift table right next to the gluten-free cupcakes. It is the Mega-Mech Dino Fortress. Limited edition. It retails for three hundred bucks on the black market. Or eBay. I thought we were going for the goody bags? The goody bags are amateur hour, Bill. Plastic kazoos and temporary tattoos do not pay the rent. The Fortress is the payload. Now, look here. The bouncy castle is located at the north end. That is a tactical nightmare, Bill said, tapping the paper. The fluctuating air pressure. The screaming. If I lose my footing in there, I am done for. You aren’t going in the castle. You are the wheelman. You wait by the rhododendrons. I go in disguised as Aunt Sheila. Do we actually know an Aunt Sheila? No, but every family has one. I have a floral distinctive muumuu and a sun hat large enough to block a satellite signal. It is foolproof. I walk in, pinch the cheeks of the birthday boy, tell him he has grown so much, and swipe the Fortress while everyone is singing. That song is a great auditory diversion, Bill nodded appreciatively. Classic timing. But what about the piñata? What about it? It is a variable, Molly. If they crack that papier-mâché donkey open before you secure the package, we are talking about a candy riot. A stampede. You could be trampled by thirty kids high on red dye number forty. I have accounted for the piñata. I replaced the bat with a pool noodle. They will be swinging at that thing until sunset. You really are a genius, Molly. She leaned back, a smirk playing on her lips. The sheer brilliance of the heist was intoxicating, mixed with the smell of stale coffee in the van. Just remember the signal. If I make the sound of a dying crow, you start the engine. Wait, Bill said. A crow? In this neighborhood? That is suspicious. Why not a lawnmower sound? Because I cannot make a lawnmower sound with my mouth, Bill. Just start the van when you hear the bird. Got it. One last thing. What? If there is extra cake... specifically the corner piece with the roses... No cake, Bill. We take the toys, we leave the carbs. Now mask up. It is party time.
created_at
"2025-11-19T00:21:11.391187+00:00"
Wednesday, November 19th, 2025 at 12:21:11 AM GMT+00:00
model
"google/gemini-3-pro-preview-20251117"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2535
moderation_latency
(null)
generation_time
21501
tokens_prompt
68
tokens_completion
1338
native_tokens_prompt
67
native_tokens_completion
2250
native_tokens_completion_images
0
native_tokens_reasoning
1492
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.027134
api_type
"completions"
id
"gen-1763511649-Yy7j7cAGK5kTCk7p26hy"
upstream_id
(null)
total_cost
0.027134
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
583
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
583
wordsDialogue
0
0.0000%