Run: 2026-05-10-01KR8W05XZ00AME8MPJXTFP92K

Run Details

Model:: Gemini 3.1 Flash Lite (Reasoning)
Test:: Dialogue tags
Scenario:: Write 200 words with 10% dialogue
Duration:: 2.3s
Cost:: $0.000390
Provider:: Google AI Studio

Evaluation summary

Score	Evaluator	Details Link
99.98%	Matches word count	(details)
36.79%	Dialogue to Total Word Ratio	(details)
68.3860%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

The suburban cul-de-sac was currently infested with pint-sized agents of chaos, their sugary shrieks echoing against the brick facade of number forty-two. Molly adjusted her tactical balaclava, which was inexplicably patterned with cartoon daisies, and flattened herself against a manicured hedge. Beside her, Bill looked ridiculous in a neon vest, clutching a blueprint drawn entirely in glitter glue. He had acquired this intelligence from a seven-year-old informant known only as Timmy the Terror. The objective was clear: secure the mountain of wrapped boxes currently guarded by a clown with a suspicious stutter. They had practiced their extraction route through the bouncy castle maze for weeks. Molly checked her watch, the ticking sound muffled by the rhythmic thumping of pop music filtering through the open patio doors. The target perimeter remained vulnerable, though a stray golden retriever was currently sniffing Bill’s boots with investigative intensity. They had one shot before the cake ceremony triggered a security lockdown of sugar-fueled toddlers. Bill nudged her shoulder, his eyes wide behind his plastic mask. Ready to liberate the haul? Molly nodded, fingers tightening around her miniature crowbar. Let us secure the loot. They lunged forward, blending into the frantic whirlwind of children.

Evaluation details

Result

Evaluator

Details

Meta Data

99.98%

Matches word count

n/a

neededClean	false
words	198

36.79%

Dialogue to Total Word Ratio

Ratio: 0.00%, Deviation: 10.00%

neededClean	false
wordsTotal	205
wordsDialogue	0

68.3860%