NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-11-04-01JBW8X8XVTR7XE47W8D5Q43CB
Run Details
Model:
Claude 3.5 Haiku
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
4.7s
Cost:
$0.001268
Evaluation summary
Score
Evaluator
Details Link
0.21%
Matches word count
(details)
14.02%
Dialogue to Total Word Ratio
(details)
7.1177%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"You're telling me we're really doing this?" Bill whispered, adjusting his fake mustache. Molly rolled her eyes. "It's the most lucrative job this month. Rich suburban kids get insane birthday gifts." "Breaking into a seven-year-old's birthday party feels... wrong," Bill muttered. "Wrong? These parents spend five grand on Lego sets and gaming consoles. We're basically redistribution experts." Bill scratched his chin. "The bouncy castle complicates entry." "That's our advantage," Molly grinned. "We'll blend in with the chaos. Parents are always distracted during these things." "What if a kid recognizes us?" "We're wearing superhero costumes. Who looks suspicious at a kid's party dressed as Batman and Wonder Woman?" Bill sighed. "This plan is ridiculous." "Ridiculous and profitable," Molly countered. "Those wrapped PlayStation 5s aren't going to steal themselves." "Can we at least wait until the cake cutting?" "Absolutely not. Maximum distraction means maximum loot. You ready?" Bill adjusted his utility belt. "I can't believe we're doing this." "Professional thieves, Bill. We adapt." The bouncy castle awaited, full of unsuspecting presents and oblivious children.
total_cost
0.001268
Evaluation details
Result
Evaluator
Details
Meta Data
0.21%
Matches word count
n/a
neededClean
false
words
172
14.02%
Dialogue to Total Word Ratio
Ratio: 78.16%, Deviation: 11.84%
neededClean
false
wordsTotal
174
wordsDialogue
136
7.1177%