NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 30% dialogue
Run ID
2026-02-08-01KGYWX2MF9MHJMHNM9GHR6BR0
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The map spread across the greasy diner table was not a blueprint of a bank vault, but a crudely drawn diagram of a suburban backyard featuring a bouncy castle and a sandbox. Bill adjusted his spectacles, which were currently held together by a piece of duct tape and sheer willpower. He poked a stubby finger at a cluster of crayon scribbles. The primary objective is located in the solarium adjacent to the clown station, Bill whispered, his voice trembling with a gravity usually reserved for international espionage. We’re looking at forty, maybe fifty high-value targets. Lego sets, remote-controlled drones, and if the intel is correct, a vintage-style rocking horse with genuine leather upholstery. Molly took a long, loud slurp of her lukewarm milkshake. Bill, it’s a six-year-old’s birthday party. We aren't exactly infiltrating the Louvre. My biggest concern isn't the security system; it’s the sticky fingers. Last time I worked a toddler gig, I came home with apple juice in my holster. You underestimate the perimeter defense, Molly, Bill countered, sliding a grainy photograph across the laminate. Look at him. That’s Barnaby the Balloon Artist. He’s got eyes like a hawk and a pump-action balloon inflator that could take an ear off at twenty paces. If he spots us without a lanyard, the whole operation goes south. Molly leaned in, squinting at the photo. He’s wearing a polka-dot waistcoat, Bill. I think I can take him. What’s the extraction plan? We can’t exactly walk out the front gate carrying a six-foot-tall stuffed giraffe without raising a few eyebrows. That’s where the tactical stroller comes in, Bill said, his eyes gleaming with pride. I’ve gutted a double-seater jogger. It’s been reinforced with a false bottom and a silent-glide suspension. We roll in as weary parents, we load the cargo into the undercarriage, and we roll out under the cover of the ‘Pin the Tail on the Donkey’ festivities. And the getaway vehicle? Molly asked. Please tell me it isn't the minivan with the sliding door that sticks. It’s the minivan, Bill admitted, deflating slightly. But I’ve changed the plates to a ‘Baby on Board’ sign. It’s the perfect cloaking device. No cop in the tri-state area stops a vehicle that smells that strongly of discarded goldfish crackers. Molly sighed and checked her watch. Fine. I’ll handle the distraction. When the cake comes out, I’ll trigger the glitter bomb inside the piñata. In the ensuing chaos of sugar-crazed children and weeping parents, you move the heavy lifting. But I’m warning you, if I get invited to play duck-duck-goose, you’re on your own. Bill nodded solemnly, folding the map into a tiny square. Professionalism is key, Molly. We hit them hard, we hit them fast, and we leave before the magician starts his second act. Nobody suspects the parents in the back with the sensible shoes. They stood up in unison, leaving a five-dollar bill on the table as if they were closing a deal on a diamond mountain. At the door, Molly paused. Wait. One more thing. Did you remember the tactical party hats? Bill patted his vest pocket. Elastic chin straps and everything. We’re ready.
created_at
"2026-02-08T15:11:20.767535+00:00"
Sunday, February 8th, 2026 at 3:11:20 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
683
moderation_latency
(null)
generation_time
6485
tokens_prompt
61
tokens_completion
711
native_tokens_prompt
67
native_tokens_completion
727
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0022145
router
(null)
provider_responses
0
status
200
is_byok
false
latency
683
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770563473-Rc3qOv29TOJ5lPgoKXio"
upstream_id
(null)
total_cost
0.0022145
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
539
0.0000%
Dialogue to Total Word Ratio
Ratio: 2.04%, Deviation: 27.96%
neededClean
false
wordsTotal
539
wordsDialogue
11
0.0000%