Model

Gemini 3 Flash (Preview)

Test

Dialogue tags

Scenario

Write 500 words with 70% dialogue

Run ID

2026-02-08-01KGYX8898FQEY819TVNFHZ1CP

Model input

User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Bill, if you suggest the Trojan Horse strategy one more time, I’m going to shove you into a giant piñata myself. But it’s classic, Molly. It’s elegant. It’s historical. I’m inside a six-foot papier-mâché donkey, they haul me into the living room, and then, boom, I emerge like a vengeful, candy-covered specter. You’re six-foot-two, Bill. You’d look less like a donkey and more like a llama with a spinal deformity. Plus, the guest list includes twenty caffeinated seven-year-olds with wooden sticks. You’d be beaten into a pulp before the cake is even cut. I have a high pain threshold for the right price. That Lego Millennium Falcon alone is worth ninety bucks on the secondary market. We are not getting assaulted by minors for ninety dollars. Look at the blueprints. This is a drawing of a bounce house on a napkin. It’s an aerial tactical map. The target is the gift table in the sunroom. Entry point is the dog flap in the kitchen. I told you, I don't fit through dog flaps since the incident in Brussels. My hips are too majestic. Then we go with the Catering Gambit. Put the mustache on. It’s itchy. Why do I have to be the waiter? I have a natural air of command. I should be the illusionist. I can do the thing where I pull a coin from behind an ear. You pulled a nickel from a security guard’s ear during the Louvre job and we almost spent ten years in a French prison. You are not a magician. You are a server of juice boxes. I hate juice boxes. They’re structurally unsound. One squeeze and you’re wearing fruit punch for the rest of the heist. Focus. While you’re distracting the mothers with a tray of organic kale puffs, I’ll be under the table. I’ve developed a specialized grappling hook made out of coat hangers and dental floss. To steal teddy bears? To steal the electronics, Bill! Little Timmy is turning eight. Eight-year-olds in this zip code get tablets. They get high-end drones. They get cold, hard plastic. What about the clown? What clown? The dossier says they hired Barnaby the Baffler. He’s a professional. He’ll spot a fake waiter in seconds. Clowns have a sixth sense for intruders. It’s the greasepaint. I’ll take out the clown. With what? A nap. I’ve laced a balloon animal kit with extra-strength sedatives. He goes for a poodle, he’s out for three hours. That’s cold, Molly. Even for us. This is the big leagues, Bill. There’s a bouncy castle, a chocolate fountain, and forty-five boxes of unopened loot. We go in at two, we’re out by cake. And if the birthday boy sees us? We tell him we’re the Federal Bureau of Birthday Inspections. Now, put on the apron. We have a heist to pull.
created_at"2026-02-08T15:17:26.218434+00:00" Sunday, February 8th, 2026 at 3:17:26 PM GMT+00:00
model"google/gemini-3-flash-preview-20251217" 
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency789
moderation_latency(null)
generation_time7029
tokens_prompt61
tokens_completion629
native_tokens_prompt67
native_tokens_completion674
native_tokens_completion_images0
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"STOP" 
usage0.0020555
router(null)
provider_responses
0
status200
is_byokfalse
latency789
endpoint_id"6e82f652-2693-4b60-bbe0-7362d6e345f0" 
provider_name"Google AI Studio" 
model_permaslug"google/gemini-3-flash-preview-20251217" 
api_type"completions" 
id"gen-1770563838-YkA2mdQxnYJAeGZj5NbJ" 
upstream_id(null)
total_cost0.0020555
cache_discount(null)
upstream_inference_cost0
provider_name"Google AI Studio" 

Evaluation details

Result Evaluator Details Meta Data
35.0022% Matches word count n/a
neededCleanfalse
words482
0.0000% Dialogue to Total Word Ratio
Ratio: 0.00%, Deviation: 70.00%
neededCleanfalse
wordsTotal482
wordsDialogue0
17.5011%