Run: 2026-02-08-01KGYXVK0CFRA8P20KRMS8R1B1

Run Details

Model:: Claude Opus 4.6
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 9.5s
Cost:: $0.008275
Provider:: Amazon Bedrock

Evaluation summary

Score	Evaluator	Details Link
100.00%	Matches word count	(details)
99.99%	Dialogue to Total Word Ratio	(details)
99.9955%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

"So the party starts at two," Molly said, spreading a hand-drawn map across the diner table. "That gives us a thirty-minute window while the clown is doing balloon animals."

"I hate clowns," Bill muttered.

"Everyone hates clowns, Bill. Focus. The presents will be stacked on a folding table near the bouncy castle."

"A bouncy castle," Bill repeated, his eyes lighting up.

"No."

"I didn't say anything."

"You were thinking it. We are not bouncing. We are grabbing and going."

Bill leaned forward. "What's our cover?"

"You're delivering a second cake."

"There's already a cake?"

"It's a birthday party, Bill. There's always a cake."

"So I just walk in with another cake and nobody questions it?"

"Have you ever met a child who questioned more cake?"

"Fair point. What about you?"

"I'm the face painter."

"Can you paint faces?"

"I can paint a butterfly that looks like a butterfly if nobody looks too closely."

"So a moth."

"It's a children's party, not an art gallery. When the kids line up for cake two, I bag the presents and go out the back."

"This is either genius or we're going to prison over a pile of Legos."

"Both seem equally likely."

Evaluation details

Result

Evaluator

Details

Meta Data

100.00%

Matches word count

n/a

neededClean	false
words	199

99.99%

Dialogue to Total Word Ratio

Ratio: 89.05%, Deviation: 0.95%

neededClean	false
wordsTotal	201
wordsDialogue	179

99.9955%