Run: 2026-02-18-01KHQ922A8VRKC33PSHX2ZH1TY

Run Details

Model:: Claude Sonnet 4.6
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 9.1s
Cost:: $0.005430
Provider:: Anthropic

Evaluation summary

Score	Evaluator	Details Link
51.93%	Matches word count	(details)
93.42%	Dialogue to Total Word Ratio	(details)
72.6704%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

"I still can't believe we're doing this," Molly whispered, peering over the garden fence.

"It's a victimless crime," Bill said.

"They're seven year olds, Bill."

"Who have too many presents. We're basically teaching them about loss. It's character building."

"Is that what we're calling it now?"

"We're philanthropists."

"We're wearing black turtlenecks outside a bouncy castle."

"Camouflage."

"It's a pink bouncy castle."

"Then we're slightly visible philanthropists. Now, the presents are by the clown."

"There's a clown?"

"Don't worry about the clown."

"Bill, I'm absolutely worrying about the clown."

"He's between us and forty wrapped gifts. Stay focused."

"What's in the bags anyway?"

"Lego, mostly. Some dolls. A remote control car that I actually want for myself."

"This is rock bottom."

"Rock bottom was the casino job."

"Fair point."

"On three we go over the fence, grab the presents, ignore the clown, and run."

"And if someone spots us?"

"Tell them we're the entertainment."

"We're not the entertainment."

"Then run faster. Ready?"

"Absolutely not."

"One, two—"

"Wait, is that clown looking at us?"

"He's always looking at us, Molly. That's what clowns do. Go."

Evaluation details

Result

Evaluator

Details

Meta Data

51.93%

Matches word count

n/a

neededClean	false
words	184

93.42%

Dialogue to Total Word Ratio

Ratio: 95.11%, Deviation: 5.11%

neededClean	false
wordsTotal	184
wordsDialogue	175

72.6704%