Run: 2026-05-20-01KS25VPCWH53SJ3F59PREK3TM

Run Details

Model:: Gemini 3.5 Flash (Reasoning, Minimal)
Test:: Dialogue tags
Scenario:: Write 200 words with 10% dialogue
Duration:: 2.4s
Cost:: $0.002341
Provider:: Google AI Studio

Evaluation summary

Score	Evaluator	Details Link
93.65%	Matches word count	(details)
36.79%	Dialogue to Total Word Ratio	(details)
65.2188%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

The afternoon sun beat down on the rusty getaway minivan, which smelled faintly of onion dip and desperation. Molly gestured aggressively at a meticulously drawn, crayon-colored map of the target backyard, her fingernails tapping against the cardboard. Bill sat in the driver’s seat, struggling to squeeze his enormous frame into a fluorescent yellow safety vest that clearly belonged to a much smaller volunteer crosswalk guard. He adjusted his papier-mache toddler mask, his eyes blinking nervously through the crudely cut eyeholes. The stakes were incredibly low, yet the tension in the vehicle was stifling. Underneath Molly’s seat lay eighteen decoy juice boxes and a stash of expensive noise-canceling headphones meant to withstand the shrieks of twenty caffeinated seven-year-olds. She checked her watch, her jaw clenched with professional intensity. Success depended entirely on their ability to blend in with the distracted parents during the chaotic climax of the pinata devastation.

We strike when they cut the cake, Molly whispered.

Bill adjusted his vest. Are we sure about the red tricycle?

It’s vintage, Bill, Molly snapped, tapping the map. Focus.

She slid the side door open, bracing herself for the impending wall of sound.

Evaluation details

Result

Evaluator

Details

Meta Data

93.65%

Matches word count

n/a

neededClean	false
words	191

36.79%

Dialogue to Total Word Ratio

Ratio: 0.00%, Deviation: 10.00%

neededClean	false
wordsTotal	196
wordsDialogue	0

65.2188%