NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-11-04-01JBW95W5VRDPV0SRW82DJJTV5
Run Details
Model:
Claude 3.5 Haiku
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
4.4s
Cost:
$0.001204
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
90.6360%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly traced the blueprint of the suburban party venue with her fingertip, her eyes narrowing with calculated precision. The balloon-festooned community center looked deceptively cheerful, but she saw it for what it was: a treasure trove of wrapped gifts just waiting to be liberated. Bill leaned back in the creaky kitchen chair, rotating a lockpick between his fingers. The room smelled of stale coffee and desperation, their makeshift planning headquarters cluttered with surveillance photos and discarded energy drink cans. "Those birthday presents are basically calling my name," Molly muttered. Bill chuckled. "Calling our names. Equal split, remember?" She rolled her eyes. The party was scheduled for Saturday afternoon, a children's birthday bash guaranteed to have expensive electronics, collector's items, and potentially some high-end jewelry gifted by overzealous grandparents. Their intel suggested at least thirty wrapped packages would be present. Molly spread the floor plan, highlighting potential entry and exit points. The ventilation system offered a promising route, though it would require some precise maneuvering. Bill studied the diagram, his fingers drumming a staccato rhythm of anticipation. "We'll need the inflatable dinosaur costume," he said with a wicked grin.
total_cost
0.001204
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
188
100.00%
Dialogue to Total Word Ratio
Ratio: 10.53%, Deviation: 0.53%
neededClean
false
wordsTotal
190
wordsDialogue
20
90.6360%