Run: 2026-05-10-01KR93DHS53NYBAXH4KG1M8GAW

Run Details

Model:: Gemma 4 26B (Reasoning)
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 2m 33s
Cost:: $0.002612
Provider:: Cloudflare

Evaluation summary

Score	Evaluator	Details Link
43.38%	Matches word count	(details)
80.84%	Dialogue to Total Word Ratio	(details)
62.1075%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

"Listen, Bill," Molly whispered. "The target is the Miller residence. Three o'clock sharp. Theme: dinosaurs."

"Dinosaurs? Is that a code for something? Like, high-stakes prehistoric fossils?" Bill asked.

"No, Bill. It's a six-year-old's birthday party. We move during the juice box lull."

"The juice box lull? That sounds tactical. Highly professional. I love a good lull."

"We hit the pile under the decorated tree. We grab the electronics, the Lego sets, anything with high resale value. Avoid the glitter. It's a tactical nightmare. It gets everywhere. It sticks to the soul, Bill."

"What about the bouncy castle?" Bill squinted. "Can we use it as a high-velocity extraction point?"

"No, Bill! We're not bouncing out of there. We slip in through the sliding glass door, bag the loot, and vanish before the cake is cut. It has to be clean."

"And if the birthday boy sees us?"

"He's five, Bill. He'll think we're the entertainment. Just keep your mask on and for the love of god, do not eat the cupcakes. We are professionals."

"Right. Professionals. No cupcakes. Got it. I am a professional."

Evaluation details

Result

Evaluator

Details

Meta Data

43.38%

Matches word count

n/a

neededClean	false
words	183

80.84%

Dialogue to Total Word Ratio

Ratio: 96.79%, Deviation: 6.79%

neededClean	false
wordsTotal	187
wordsDialogue	181

62.1075%