NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-11-04-01JBW91S60XTNYHT7JJMD98C4G
Run Details
Model:
Claude 3.5 Haiku
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
5.9s
Cost:
$0.001236
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
99.60%
Dialogue to Total Word Ratio
(details)
71.4900%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the kitchen table, her fingers tracing the layout of the suburban home. Bill leaned in, adjusting his thick-rimmed glasses and sipping from a lukewarm coffee mug. "Kids' parties are always chaos," Molly muttered, tapping a marker against the diagram. The living room showed multiple entry points, potential blind spots near the gift table. Streamers and balloons would provide excellent cover. Bill nodded, his mustache twitching with anticipation. They'd cased the joint for weeks—wealthy family, spoiled twelve-year-old's birthday, expensive electronics and collector's items wrapped in shiny paper. Amateur parents would be distracted by screaming children and sugar-fueled mayhem. "We go in during the piñata," Bill whispered. "Maximum distraction." Molly's lips curled into a sly grin. The plan was elegant in its simplicity. Matching catering staff uniforms, a carefully timed diversion, and quick exit strategy. Years of heists had taught them that children's parties were a goldmine of opportunity—parents too overwhelmed to notice two extra "staff" members slipping through the crowd. She rolled up the blueprints, already imagining the haul. Another perfect score was about to go down.
total_cost
0.001236
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
183
99.60%
Dialogue to Total Word Ratio
Ratio: 7.49%, Deviation: 2.51%
neededClean
false
wordsTotal
187
wordsDialogue
14
71.4900%