Run: 2026-02-25-01KJ94YSGN513YVRBQ3RZRJYHZ

Run Details

Model:: Gemini 2.5 Flash Lite (Reasoning)
Test:: Dialogue tags
Scenario:: Write 200 words with 50% dialogue
Duration:: 9.5s
Cost:: $0.000773
Provider:: Google AI Studio

Evaluation summary

Score	Evaluator	Details Link
60.28%	Matches word count	(details)
0.00%	Dialogue to Total Word Ratio	(details)
30.1378%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly tapped a smudged diagram drawn on the back of a Pizza Palace menu. "Operation: Birthday Bonanza is a go. Timmy's seventh. Max present accumulation window opens precisely at three PM."

Bill squinted, his perpetually hopeful eyes tracing the greasy lines. "So, we crash the party, snag the loot, and vanish like a fart in the wind?"

"Subtlety, Bill. We're ninjas in disguise, not rampaging toddlers." Molly adjusted her ill-fitting fake glasses. "My intel suggests the prime loot is near the sofa, piled high. We need a distraction. I've procured props."

"Ooh, props!" Bill bounced in his seat. "Am I a giant, menacing bunny rabbit?"

"You're a clown. A balloon-twisting, face-painting terror. While you're busy terrifying the small ones and promising them cheap plastic trinkets, I'll be liberating the wrapped treasures. Focus on anything shimmering." Molly ignored Bill’s sudden attempt to juggle two sugar packets, which promptly exploded. "No juggling. Just clowning."

Bill looked crestfallen. "But what about the cake?"

Molly sighed, already calculating the escape route. "Priorities, Bill. Presents first, cake second. Or maybe not at all. It's a heist, not a five-star buffet."

Evaluation details

Result

Evaluator

Details

Meta Data

60.28%

Matches word count

n/a

neededClean	false
words	185

0.00%

Dialogue to Total Word Ratio

Ratio: 68.78%, Deviation: 18.78%

neededClean	false
wordsTotal	189
wordsDialogue	130

30.1378%