Run: 2026-02-18-01KHQ94JMGN9JKCFKVRAMJ264P

Run Details

Model:: Claude Sonnet 4.6
Test:: Dialogue tags
Scenario:: Write 200 words with 50% dialogue
Duration:: 8.0s
Cost:: $0.004695
Provider:: Anthropic

Evaluation summary

Score	Evaluator	Details Link
86.38%	Matches word count	(details)
32.20%	Dialogue to Total Word Ratio	(details)
59.2909%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly pressed her back against the garden fence, clutching a clipboard like she was coordinating a military operation rather than robbing a six year old's birthday party.

"The bouncy castle is our biggest obstacle," she whispered, sketching a rough diagram with her pen.

"How is a bouncy castle an obstacle?" Bill asked.

"Because you'll want to go on it. I know you, Bill."

Bill opened his mouth to object, then closed it again. She had a point.

"The presents are stacked by the gazebo," Molly continued, tapping her clipboard. "We go in during musical chairs. Everyone watches musical chairs."

"What about the clown?"

"What about him?"

"He's been staring at me for ten minutes."

Molly glanced over the fence. The clown was indeed staring directly at Bill with an expression of profound suspicion.

"Clowns always look like that," she said.

"That one has a walkie talkie."

Molly looked again. He absolutely did.

"Okay, new plan," she said.

"Run away?"

"Run away," she confirmed.

They both sprinted down the street, Bill somehow still managing to glance longingly back at the bouncy castle as the clown calmly radioed in their description.

Evaluation details

Result

Evaluator

Details

Meta Data

86.38%

Matches word count

n/a

neededClean	false
words	189

32.20%

Dialogue to Total Word Ratio

Ratio: 39.68%, Deviation: 10.32%

neededClean	false
wordsTotal	189
wordsDialogue	75

59.2909%