NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2024-10-16-01JAB3N1E4V3S87QPB7FJBE8J8
Run Details
Model:
Ministral 3B
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
2.1s
Cost:
$0.000015
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
66.13%
Dialogue to Total Word Ratio
(details)
76.2575%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
In the shadows of a suburban neighborhood, Molly and Bill stood, assessing the target: a sprawling house with a vibrant sign reading "Happy Birthday, Timmy!" Molly adjusted her gloves, her eyes gleaming with mischief. "Alright, Bill, we've got the plan. We get in, we get the presents, and we get out. No more than fifteen minutes." Bill smirked, pulling a small lock-picking tool from his pocket. "You always know how to make a job sound simple." Molly rolled her eyes. "You're telling me. I've seen your 'simple' jobs." Bill chuckled, "Well, we've got to be quick. The parents will be back from the store any minute." Molly nodded, scanning the house. "And we've got to be quiet. That Timmy kid might see us." Bill grinned. "Don't worry, we've practiced this. Quiet as mice." Molly sighed, "You know, I'd rather just steal the presents than deal with his mom's 'open' security system." Bill laughed, "You can't steal from a kid without a heart, Molly. That's not how we roll." Molly looked at him, a smile on her face. "Fine. Let's do this. But if I see that kid, I'm out." Bill winked, "Deal." They moved forward, their steps silent on the grass. The house loomed before them, the presents a tantalizing prize.
total_cost
0.00001524
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
211
66.13%
Dialogue to Total Word Ratio
Ratio: 58.02%, Deviation: 8.02%
neededClean
false
wordsTotal
212
wordsDialogue
123
76.2575%