Run: 2026-04-24-01KQ0PEVFWRJQDNVPNSZVQ5EBG

Run Details

Model:: GPT-5.5
Test:: Dialogue tags
Scenario:: Write 200 words with 10% dialogue
Duration:: 11.3s
Cost:: $0.009330
Provider:: OpenAI

Evaluation summary

Score	Evaluator	Details Link
75.16%	Matches word count	(details)
96.35%	Dialogue to Total Word Ratio	(details)
85.7507%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly crouched behind the hedge, binoculars trained on the church hall where balloons bobbed like suspiciously cheerful sentries. Bill lay beside her in a borrowed clown suit, sweating glitter and regret.

Inside, twenty sugar-charged children circled a mountain of wrapped presents. The cake was taller than most of them and decorated with a pirate ship, which Molly considered insulting to professional criminals.

“Plan?” Bill whispered.

“Enter, distract, exit,” Molly said.

“With presents?”

“Obviously.”

Their getaway wagon waited in the alley, painted with the words FUNTIME PUPPET MINISTRY because Bill had misunderstood “low profile.” Molly adjusted her party hat and checked the inventory: smoke pellets, gift sacks, two juice boxes, and a small plastic kazoo for emergencies.

At precisely two o’clock, the magician arrived, tripped over his rabbit cage, and released four doves into the punch. Molly smiled. Chaos had RSVP’d.

They slipped through the side door. A toddler immediately handed Bill a paper crown and declared him King Banana. Bill bowed, which earned applause and sticky fingerprints.

“Focus,” Molly hissed.

“I’m beloved.”

Molly reached for the nearest present, but it began barking. The tag read: To Emma, Love Grandma. Air holes. Molly froze. Bill lifted another box; it meowed furiously.

Outside, sirens wailed. Molly sighed. Children’s parties, she reflected, had dangerously advanced security.

Evaluation details

Result

Evaluator

Details

Meta Data

75.16%

Matches word count

n/a

neededClean	false
words	213

96.35%

Dialogue to Total Word Ratio

Ratio: 5.61%, Deviation: 4.39%

neededClean	false
wordsTotal	214
wordsDialogue	12

85.7507%