NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-05-22-01JVWG38JQV4B660WKFWX64AD9
Run Details
Model:
Claude Sonnet 4
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
7.3s
Cost:
$0.004395
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
60.0948%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly adjusted her ridiculous rainbow wig while crouched behind Mrs. Henderson's prize-winning azaleas. The sound of shrieking six-year-olds echoed from the backyard, punctuated by the occasional adult desperately suggesting everyone "use their inside voices." She squinted through her oversized novelty sunglasses at the mountain of wrapped boxes stacked precariously on the patio table. Bill emerged from behind the garden shed wearing a polka-dotted bow tie that had seen better decades and a painted-on smile that looked more serial killer than children's entertainer. His fake mustache drooped pathetically in the afternoon heat. "Remember the plan," Molly whispered, checking her watch. "Right. I distract them with balloon animals while you grab the loot." The birthday girl's mother appeared in the doorway, frantically waving at someone inside. Perfect timing. Molly gave Bill the signal, and he strutted into the chaos with the confidence of a man who had never successfully twisted a balloon in his life. Within seconds, he was surrounded by tiny sugar-fueled maniacs demanding giraffes, elephants, and dinosaurs. His first balloon attempt resulted in what could generously be called abstract art.
total_cost
0.004395
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
180
100.00%
Dialogue to Total Word Ratio
Ratio: 10.22%, Deviation: 0.22%
neededClean
false
wordsTotal
186
wordsDialogue
19
60.0948%