Model

Gemini 3 Flash (Preview)

Test

Dialogue tags

Scenario

Write 500 words with 50% dialogue

Run ID

2026-02-08-01KGYWX7J4DX0NP5T06VC0DEPT

Model input

User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Bill adjusted his monocle, which was actually a piece of a broken magnifying glass taped to his cheek. It’s a bold target, he said, pointing at a crayon-drawn map of the suburban backyard. Is it high risk? Absolutely. Is it high reward? Bill, it’s a seven-year-old’s birthday, Molly interrupted, sliding a pair of tactical knitting needles into her belt. The reward is Lego sets and maybe a motorized Barbie jeep. We are stealing from a child. Not just a child, Bill corrected, pacing the length of their hideout, which was a damp crawlspace under a dry cleaner. We are liberating goods from a tyrant named Timmy. I heard he doesn’t even share his juice boxes. My heart bleeds for the proletariat, Molly said, checking the battery on her illicitly modified bubble blower. How do we bypass the perimeter? The grandmother is the primary obstacle, Bill noted, tapping a grainy photo of an elderly woman holding a plate of brownies. She’s vigilant. She has a sense of smell like a bloodhound and a lethal grip on a spatula. How do we take her out? We don’t take her out, we distract her with talk of rising knitting wool prices, Molly suggested. While she’s distracted, I’ll slip through the bouncy castle. The bouncy castle is a death trap, Bill whispered, his eyes wide. One wrong hop and you’re trapped in a vinyl abyss. I’ve trained for this on my neighbor’s trampoline, Molly replied. What’s the status of the getaway vehicle? I’ve secured the tricycle with the oversized basket, Bill said proudly. It’s parked behind the hydrangea bushes. Is it fast? It’s aerodynamic if the wind is at my back and I don’t hit any rogue pebbles. Molly sighed, checking her watch. The cake cutting is at two. That’s our window. When the sugar rush hits, the parents will be distracted by the screaming. That’s when we move for the gift pile. What if there’s a clown? Bill asked, his voice trembling slightly. We discussed this, Bill. If there’s a clown, you use the pepper spray. But it’s a professional clown! Then use the professional pepper spray. Molly grabbed her gear and headed for the exit. Are you ready for the heist of the century? Bill straightened his stained vest and grabbed a bag of marbles. For the Legos, he declared. For the Barbie jeep, she corrected. They crawled out into the sunlight, squinting at the bright balloons of the enemy territory. Do you think they’ll have those little cocktail sausages? Bill asked. Focus, Bill, Molly hissed. We’re professional thieves, not snack enthusiasts. I can be two things, Molly. I can be two things.
created_at"2026-02-08T15:11:23.102754+00:00" Sunday, February 8th, 2026 at 3:11:23 PM GMT+00:00
model"google/gemini-3-flash-preview-20251217" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency724
moderation_latency(null)
generation_time5350
tokens_prompt61
tokens_completion581
native_tokens_prompt67
native_tokens_completion584
native_tokens_completion_images0
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"STOP" 
usage0.0017855
router(null)
provider_responses
0
status200
is_byokfalse
latency724
endpoint_id"6e82f652-2693-4b60-bbe0-7362d6e345f0" 
provider_name"Google AI Studio" 
model_permaslug"google/gemini-3-flash-preview-20251217" 
api_type"completions" 
id"gen-1770563477-z4VO3XcGw4WzyPA3wb62" 
upstream_id(null)
total_cost0.0017855
cache_discount(null)
upstream_inference_cost0
provider_name"Google AI Studio" 

Evaluation details

Result Evaluator Details Meta Data
0.0000% Matches word count n/a
neededCleanfalse
words444
0.0000% Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 50.00%
neededCleanfalse
wordsTotal444
wordsDialogue0
0.0000%