Run: 2026-03-12-01KKHZM11J2EG55S444P22496N

Run Details

Model:: Grok 4.20 (Beta)
Test:: Dialogue tags
Scenario:: Write 200 words with 50% dialogue
Duration:: 2.1s
Cost:: $0.001748
Provider:: xAI

Evaluation summary

Score	Evaluator	Details Link
97.63%	Matches word count	(details)
0.00%	Dialogue to Total Word Ratio	(details)
48.8138%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly adjusted her fake mustache and peered through the hedge at the bouncing castle. "This is insane, Bill. We're not stealing from a bank, we're crashing a kid's party. What if they have a piñata guard?"

Bill grinned, checking his clown wig. "Relax, Mol. We go in as entertainers. I juggle, you do face painting. While the little monsters are distracted by cake, we bag the presents. Simple."

"Simple? There's like thirty screaming six-year-olds in there," Molly hissed. "One of them looks like he could take us both. And is that a magician? We hate magicians."

"Exactly why it's perfect," Bill whispered. "Nobody suspects the clowns. We'll blend right in. You grab the gift table, I'll cause a distraction with my balloon animals. Who doesn't love balloon animals?"

"I don't love balloon animals, Bill. And what about the parents? They're not idiots."

"Parents? They'll be too busy filming everything for Instagram. By the time they notice the presents are gone, we'll be halfway to the ice cream truck with enough loot to retire."

Molly sighed. "If we get caught because some kid wants his face painted like a dinosaur, I'm blaming you."

"Deal. Now quit whining and practice your silly voice. Time to steal Christmas... in June."

Evaluation details

Result

Evaluator

Details

Meta Data

97.63%

Matches word count

n/a

neededClean	false
words	207

0.00%

Dialogue to Total Word Ratio

Ratio: 87.56%, Deviation: 37.56%

neededClean	false
wordsTotal	209
wordsDialogue	183

48.8138%