Run: 2026-04-24-01KPZV6VM446Z7VFRH3BX0YVXD

Run Details

Model:: Claude Opus 4.7
Test:: Dialogue tags
Scenario:: Write 200 words with 10% dialogue
Duration:: 7.1s
Cost:: $0.010490
Provider:: Amazon Bedrock

Evaluation summary

Score	Evaluator	Details Link
97.63%	Matches word count	(details)
66.94%	Dialogue to Total Word Ratio	(details)
82.2861%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly spread the blueprints across the sticky diner table, smoothing them with the side of her hand. The paper was covered in juice box rings and a suspicious smear of what appeared to be frosting, which was either reconnaissance or a terrible omen. Bill, hunched over his coffee like a gargoyle nursing a grudge, studied the layout of the Pemberton residence with the grim intensity of a man who had been outsmarted by a bounce house once and never recovered.

The party, according to their intel, would feature seventeen six-year-olds, a magician of questionable licensing, and approximately forty-two wrapped presents stacked in the sunroom like a glittering Everest of capitalism. Molly had already mapped three exits, two blind spots, and the precise location of the piñata, which she'd flagged as a potential weapon. Bill had dressed for the occasion in a clown costume he'd bought online, though the ruffled collar was, in his professional opinion, an affront to dignity.

"I hate children," Bill muttered.

"Focus," Molly said.

He pointed to the cake. She nodded. They had forty minutes until showtime, a duffel bag full of balloons, and absolutely no moral compass between them.

Evaluation details

Result

Evaluator

Details

Meta Data

97.63%

Matches word count

n/a

neededClean	false
words	193

66.94%

Dialogue to Total Word Ratio

Ratio: 2.04%, Deviation: 7.96%

neededClean	false
wordsTotal	196
wordsDialogue	4

82.2861%