Run: 2026-02-08-01KGYYHG2M5VP3HBVQ5XDEKZWR

Run Details

Model:: Claude Opus 4.5
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 9.2s
Cost:: $0.008675
Provider:: Amazon Bedrock

Evaluation summary

Score	Evaluator	Details Link
27.17%	Matches word count	(details)
99.72%	Dialogue to Total Word Ratio	(details)
63.4413%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly spread the crayon-drawn map across the sticky diner table.

"So the target is Timmy's seventh birthday party. Security is tight."

"How tight are we talking?" Bill leaned forward.

"One grandmother at the front door. She's got cataracts but hearing like a bat."

"Lovely. What about the presents?"

"Stacked in the living room. Behind the bounce house."

"There's a bounce house inside?"

"These people have money, Bill. That's why we're hitting them."

"Fair enough. Entry point?"

"Backyard gate. We go in dressed as clowns."

"I hate clowns."

"Everyone hates clowns. That's why nobody questions them. They just want them gone."

"That's actually brilliant."

"I know. Now, the tricky part is the dog."

"There's a dog?"

"Golden retriever. Name's Biscuit."

"Aggressive?"

"Aggressively friendly. He'll blow our cover with love."

"I'll bring treats."

"Bring many treats. This dog is a bottomless pit."

"What's our exit strategy?"

"The magician arrives at three. During his act, everyone watches. We load the van."

"We have a van?"

"We have my nephew's minivan. It has juice stains and smells like failure."

"Perfect cover."

"That's what I said."

Evaluation details

Result

Evaluator

Details

Meta Data

27.17%

Matches word count

n/a

neededClean	false
words	181

99.72%

Dialogue to Total Word Ratio

Ratio: 92.31%, Deviation: 2.31%

neededClean	false
wordsTotal	182
wordsDialogue	168

63.4413%