NC Bench
Overview
About
Tests
Models
Model
Claude 3.0 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2024-07-19-01J35TB85YMG55S6WWRG7M1ZPT
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly leaned back in her chair, a mischievous glint in her eyes. "So, Bill, you ready for the biggest score of our lives?" Bill chuckled, taking a sip of his whiskey. "You know it, partner. Stealing presents from a bunch of snot-nosed brats? That's a new low, even for us." "Ah, but that's where the challenge lies," Molly retorted. "Who would suspect a pair of seasoned thieves like us at a children's party?" "Fair point," Bill conceded. "But how do we even get in? I doubt they'll just let us waltz in like we're the clowns." Molly grinned wickedly. "Leave that to me. I've got a foolproof plan involving a giant teddy bear costume and a kazoo." Bill raised an eyebrow skeptically. "A kazoo? Really?" "Trust me, it'll be hilarious," Molly assured him. "Now, let's go over the escape route one more time..." As they meticulously plotted their heist, the two thieves couldn't help but snicker at the sheer audacity of their plan. Who knew stealing from children could be so entertaining?
id
"gen-0zotw4Et99x2jJ8X7WWebGj4oEDE"
total_cost
0.0045
upstream_id
"msg_01DRMckx7MZx1cekRSytsJzX"
created_at
"2024-07-19T15:39:13.995111+00:00"
Friday, July 19th, 2024 at 3:39:13 PM GMT+00:00
model
"anthropic/claude-3-sonnet:beta"
app_id
182717
streamed
false
cancelled
false
provider_name
"Anthropic"
latency
5903
moderation_latency
(null)
generation_time
0
finish_reason
"end_turn"
tokens_prompt
68
tokens_completion
248
native_tokens_prompt
75
native_tokens_completion
285
num_media_prompt
(null)
num_media_completion
(null)
origin
"https://ncbench.com/"
usage
0.0045
Evaluation details
Result
Evaluator
Details
Meta Data
0.4920%
Matches word count
n/a
neededClean
false
words
173
5.7201%
Dialogue to Total Word Ratio
Ratio: 63.01%, Deviation: 13.01%
neededClean
false
wordsTotal
173
wordsDialogue
109
3.1061%