Run: 2026-06-01-01KT1HP384CDMTCGPYMSG337YR

Run Details

Model:: Claude Opus 4.8 (Reasoning, Low)
Test:: Dialogue tags
Scenario:: Write 200 words with 50% dialogue
Duration:: 10.0s
Cost:: $0.013315
Provider:: Anthropic

Evaluation summary

Score	Evaluator	Details Link
99.74%	Matches word count	(details)
0.93%	Dialogue to Total Word Ratio	(details)
50.3375%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly spread the crayon-drawn map across the table, smoothing its crumpled edges.

"The bouncy castle is our main obstacle," she said, tapping the wobbly blue rectangle. "Security's tight."

Bill squinted. "Security? They're six-year-olds."

"Six-year-olds with sugar-fueled reflexes, Bill. Don't underestimate them."

He scratched his chin, considering this. "And the presents?"

"Stacked by the gift table. We grab them during the piñata frenzy." She mimed swinging a bat. "Total chaos. Nobody watches the loot."

"Brilliant," Bill whispered. "But what about the clown?"

Molly's face darkened. "Pickles the Clown. He's the wildcard. Trained in balloon animals and, allegedly, hand-to-hand combat."

"You're joking."

"I never joke about clowns."

Bill swallowed nervously. "So what's the plan if Pickles spots us?"

"You distract him." Molly handed Bill a single red nose. "Blend in. Become one of them."

"Become a clown?" He held the nose like it was radioactive. "Absolutely not."

"It's that or face the parents, Bill. Have you ever seen a mother whose child's gift got stolen? They're feral."

He sighed, defeated, and snapped the nose onto his face.

"How do I look?"

Molly studied him seriously. "Terrifying. The children will weep."

"Perfect," Bill muttered. "Let's go ruin a birthday."

Evaluation details

Result

Evaluator

Details

Meta Data

99.74%

Matches word count

n/a

neededClean	false
words	196

0.93%

Dialogue to Total Word Ratio

Ratio: 64.71%, Deviation: 14.71%

neededClean	false
wordsTotal	204
wordsDialogue	132

50.3375%