NC Bench
Overview
About
Tests
Models
Model
Gemini 3 Flash (Preview)
Test
Dialogue tags
Scenario
Write 500 words with 50% dialogue
Run ID
2026-02-08-01KGYWX7J4DX0NP5T06VC0DEPT
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill adjusted his monocle, which was actually a piece of a broken magnifying glass taped to his cheek. It’s a bold target, he said, pointing at a crayon-drawn map of the suburban backyard. Is it high risk? Absolutely. Is it high reward? Bill, it’s a seven-year-old’s birthday, Molly interrupted, sliding a pair of tactical knitting needles into her belt. The reward is Lego sets and maybe a motorized Barbie jeep. We are stealing from a child. Not just a child, Bill corrected, pacing the length of their hideout, which was a damp crawlspace under a dry cleaner. We are liberating goods from a tyrant named Timmy. I heard he doesn’t even share his juice boxes. My heart bleeds for the proletariat, Molly said, checking the battery on her illicitly modified bubble blower. How do we bypass the perimeter? The grandmother is the primary obstacle, Bill noted, tapping a grainy photo of an elderly woman holding a plate of brownies. She’s vigilant. She has a sense of smell like a bloodhound and a lethal grip on a spatula. How do we take her out? We don’t take her out, we distract her with talk of rising knitting wool prices, Molly suggested. While she’s distracted, I’ll slip through the bouncy castle. The bouncy castle is a death trap, Bill whispered, his eyes wide. One wrong hop and you’re trapped in a vinyl abyss. I’ve trained for this on my neighbor’s trampoline, Molly replied. What’s the status of the getaway vehicle? I’ve secured the tricycle with the oversized basket, Bill said proudly. It’s parked behind the hydrangea bushes. Is it fast? It’s aerodynamic if the wind is at my back and I don’t hit any rogue pebbles. Molly sighed, checking her watch. The cake cutting is at two. That’s our window. When the sugar rush hits, the parents will be distracted by the screaming. That’s when we move for the gift pile. What if there’s a clown? Bill asked, his voice trembling slightly. We discussed this, Bill. If there’s a clown, you use the pepper spray. But it’s a professional clown! Then use the professional pepper spray. Molly grabbed her gear and headed for the exit. Are you ready for the heist of the century? Bill straightened his stained vest and grabbed a bag of marbles. For the Legos, he declared. For the Barbie jeep, she corrected. They crawled out into the sunlight, squinting at the bright balloons of the enemy territory. Do you think they’ll have those little cocktail sausages? Bill asked. Focus, Bill, Molly hissed. We’re professional thieves, not snack enthusiasts. I can be two things, Molly. I can be two things.
created_at
"2026-02-08T15:11:23.102754+00:00"
Sunday, February 8th, 2026 at 3:11:23 PM GMT+00:00
model
"google/gemini-3-flash-preview-20251217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
724
moderation_latency
(null)
generation_time
5350
tokens_prompt
61
tokens_completion
581
native_tokens_prompt
67
native_tokens_completion
584
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"STOP"
usage
0.0017855
router
(null)
provider_responses
0
status
200
is_byok
false
latency
724
endpoint_id
"6e82f652-2693-4b60-bbe0-7362d6e345f0"
provider_name
"Google AI Studio"
model_permaslug
"google/gemini-3-flash-preview-20251217"
api_type
"completions"
id
"gen-1770563477-z4VO3XcGw4WzyPA3wb62"
upstream_id
(null)
total_cost
0.0017855
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Google AI Studio"
Evaluation details
Result
Evaluator
Details
Meta Data
0.0000%
Matches word count
n/a
neededClean
false
words
444
0.0000%
Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededClean
false
wordsTotal
444
wordsDialogue
0
0.0000%