Run: 2026-04-24-01KQ09N5A11JWRNKQM6W4PAKFZ

Run Details

Model:: GPT-OSS 120B
Test:: Dialogue tags
Scenario:: Write 200 words with 90% dialogue
Duration:: 1m 39s
Cost:: $0.000544
Provider:: DeepInfra

Evaluation summary

Score	Evaluator	Details Link
99.98%	Matches word count	(details)
63.25%	Dialogue to Total Word Ratio	(details)
81.6188%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

The night sky glimmered above the backyard where the birthday balloons floated like bright planets, awaiting the stealthy duo musically.  
Molly: "We need a plan for the party."  
Bill: "Alright, what's the layout?"  
Molly: "Kids will be in the living room, balloons everywhere, cake table in the corner."  
Bill: "We slip in through the side gate, avoid the garden lights."  
Molly: "I bring the grappling hook, you disable the alarm with the jammer."  
Bill: "How long until the parents leave?"  
Molly: "Thirty minutes after the clown leaves, the kids will be distracted by the piñata."  
Bill: "Piñata—great cover, they’ll be busy swinging."  
Molly: "Then we grab the gift table, dump the boxes, and load them into the van."  
Bill: "What about the presents with stickers? We sort them later."  
Molly: "Exactly, we only need the boxes, the toys, the gadgets."  
Bill: "And the candy stash—big profit."  
Molly: "We split the loot fifty‑fifty, plus a bonus for whoever carries the most."  
Bill: "Deal. I’ll meet you at the trash bin at midnight."  
Molly: "See you then, partner in crime."  
Bill: "I'll bring the night-vision goggles; they hide us like shadows."  
Molly: "And I’ll wear the clown suit, nobody suspects a thief."

Evaluation details

Result

Evaluator

Details

Meta Data

99.98%

Matches word count

n/a

neededClean	false
words	202

63.25%

Dialogue to Total Word Ratio

Ratio: 81.77%, Deviation: 8.23%

neededClean	false
wordsTotal	203
wordsDialogue	166

81.6188%