Run: 2026-05-10-01KR91ZZWJJVNWYP225WN7TF9S

Run Details

Model:: Gemma 4 26B
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 5.8s
Cost:: $0.000108
Provider:: DeepInfra

Evaluation summary

Score	Evaluator	Details Link
99.74%	Matches word count	(details)
96.08%	Dialogue to Total Word Ratio	(details)
97.9112%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

"Is that a clown nose in your pocket, Bill?" Molly whispered, glaring at him through the binoculars.

"It’s tactical camouflage!" Bill hissed back. "Nobody suspects the birthday clown."

"We are here for the high-end electronics and designer LEGO sets, not to juggle for toddlers."

"I’m just saying, the element of surprise is key. If I honk this horn at the right moment, they’ll be too distracted to notice us emptying the gift table."

"You are a liability. We sneak in through the patio, grab the loot, and vanish before the cake is cut."

"But what about the piñata? It’s a goldmine of premium confectionery."

"No piñata heists, Bill! We are professionals, not sugar-addicts."

"Fine, fine. But if a kid asks me to make a balloon animal, I’m not promising anything."

"Just stay focused. Do you have the heavy-duty gift bags?"

"Got 'em. Extra large, reinforced handles, very discreet."

"And the getaway tricycle?"

"It’s a customized mountain bike with a basket. Very fast. Very stealthy."

"If we get caught because you’re riding a tricycle, I am leaving you to explain the clown nose to the parents."

"Deal. Now, let’s go. I think the bouncy castle is starting."

Evaluation details

Result

Evaluator

Details

Meta Data

99.74%

Matches word count

n/a

neededClean	false
words	196

96.08%

Dialogue to Total Word Ratio

Ratio: 94.47%, Deviation: 4.47%

neededClean	false
wordsTotal	199
wordsDialogue	188

97.9112%