Run: 2026-02-20-01KHYAK7QTC1QBR9A6WWCTG7SH

Run Details

Model:: Gemma 3 27B
Test:: Dialogue tags
Scenario:: Write 200 words with 10% dialogue
Duration:: 5.1s
Cost:: $0.000089
Provider:: Nebius

Evaluation summary

Score	Evaluator	Details Link
14.30%	Matches word count	(details)
0.00%	Dialogue to Total Word Ratio	(details)
7.1507%	Overall

Model input

User/Human

You are a author of comedic heist books.

Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents.

Only include the prose of the scene without any markdown or commentary.

Model response

The inflatable bouncy castle loomed, a garish pink and purple monstrosity dominating Mrs. Higgins’ perfectly manicured garden. Molly squinted, adjusting the ridiculous oversized sunglasses. Bill, meanwhile, was meticulously studying a diagram scrawled on a napkin – a tactical overview of the party’s layout, complete with estimations of juice box consumption and potential Lego-related trip hazards. 

“Right,” Bill muttered, circling the gift table in red pen. “Maximum security zone. Probably guarded by a six-year-old with a sticky hand.”

Molly snorted. She’d handled worse. Like that time with the poodle and the diamond collar. “Focus, Bill. We need a diversion.” 

He tapped his chin. “Pinata?”

“Too obvious. Think… subtle chaos.” Molly spotted a small child attempting to scale the bouncy castle. Inspiration struck. 

“You handle the emotional support,” she said, already reaching for her pocketful of bubbles.

Bill blinked. "Emotional support?"

“Just… look concerned while I unleash a bubble storm of epic proportions. It’ll be beautiful. And distracting. They’ll be too busy popping bubbles to notice a couple of present-acquisition specialists.”

He sighed. “This is why they don't let me plan anymore."

Evaluation details

Result

Evaluator

Details

Meta Data

14.30%

Matches word count

n/a

neededClean	false
words	179

0.00%

Dialogue to Total Word Ratio

Ratio: 42.62%, Deviation: 32.62%

neededClean	false
wordsTotal	183
wordsDialogue	78

7.1507%