Run: 2026-04-24-01KQ0G57D0D4W33WRPC16A58FN

Run Details

Model:: Claude Opus 4.7 (Reasoning)
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 7.2s
Cost:: $0.011965
Provider:: Amazon Bedrock

Evaluation summary

Score	Evaluator	Details Link
81.27%	Matches word count	(details)
95.20%	Dialogue to Total Word Ratio	(details)
88.2372%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly unfolded the blueprint across the hood of the van.

"Okay, Bill. Bouncy castle at eleven o'clock, clown at two o'clock, piñata—dead center."

"What about the kids?"

"What about them?"

"They're, like, everywhere, Molly. They're a swarm. They're a biblical plague in sparkle shoes."

"That's why we wear the costumes."

"I'm not being a clown."

"You're being the clown, Bill. I already called princess."

"Why do you get to be princess?"

"Because I have the tiara, Bill. I have committed to the bit."

"Fine. What's the extraction plan?"

"Simple. You distract with balloon animals, I sweep the gift table, we meet at the minivan."

"I can't make balloon animals."

"Make a snake. A snake is a straight balloon, Bill. A snake is literally the balloon."

"And if someone asks for a giraffe?"

"Tell them the giraffe is extinct."

"Giraffes aren't extinct."

"They are at this party."

"What about the parents?"

"Drunk on boxed wine by one-fifteen. Window closes at two when the sugar crash hits."

"And if we get caught?"

"We cry, Bill. Loudly. Nobody arrests a sobbing princess."

"That's the whole plan?"

"That's the whole plan."

Evaluation details

Result

Evaluator

Details

Meta Data

81.27%

Matches word count

n/a

neededClean	false
words	188

95.20%

Dialogue to Total Word Ratio

Ratio: 94.71%, Deviation: 4.71%

neededClean	false
wordsTotal	189
wordsDialogue	179

88.2372%