Run: 2026-04-24-01KQ0PK246TWPDYEY3BGQGGWK5

Run Details

Model:: GPT-5.5 (Reasoning, Low)
Test:: Dialogue tags
Scenario:: Write 200 words with 10% dialogue
Duration:: 18.6s
Cost:: $0.039540
Provider:: OpenAI

Evaluation summary

Score	Evaluator	Details Link
20.19%	Matches word count	(details)
100.00%	Dialogue to Total Word Ratio	(details)
60.0942%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

Molly unfolded the invitation like a map to buried treasure, except it smelled of frosting and had a dinosaur wearing a party hat on the front. Bill peered over her shoulder, crumbs from his emergency biscuit sprinkling onto the carpet. The target was young Percival Plunkett’s seventh birthday, noon, community hall, theme: pirates. Fate had gift-wrapped their opportunity in tissue paper and supervised it with balloon animals.

"Molly, we stroll in as entertainers, juggle badly, then leave with loot."
"Bill, you can't juggle."
"Exactly. Authentic children's entertainment."

They sat in the van, which was painted with the words Definitely Not Suspicious Plumbing, watching parents carry parcels inside. One box was tall enough to contain a pony or, more likely, educational disappointment. Molly adjusted her fake mustache; Bill practiced saying “ahoy” with the confidence of a man who had once been chased by a swan and called it maritime experience.

Their equipment was simple: empty sacks, squeaky shoes, and a moral compass Molly had won in a Christmas cracker. It spun uselessly whenever pointed at cake. Across the street, children shrieked with joy, unaware that two criminal masterminds were about to be defeated by pass-the-parcel, sticky fingers, and one grandmother with handbag reflexes like a cobra. Bill gulped; Molly grinned, already hearing sirens in the kazoo chorus and applauding anyway loudly.

Evaluation details

Result

Evaluator

Details

Meta Data

20.19%

Matches word count

n/a

neededClean	false
words	220

100.00%

Dialogue to Total Word Ratio

Ratio: 9.42%, Deviation: 0.58%

neededClean	false
wordsTotal	223
wordsDialogue	21

60.0942%