NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-02-24-01JMX1P92FDTAYW57XGJETFE4G
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Bill scratched his scruffy beard, looking at the blueprints spread across the table. "So the kid's party is tomorrow at three. The presents will be piled near that bay window." "Perfect," Molly said, twirling a lock of red hair. "While the magician has everyone distracted with his rabbit trick, we slip in through the back door." "You sure that'll work? What about the parents?" "Trust me, they'll all be filming their precious angels. Nobody watches the gifts." Bill frowned. "I dunno, Mol. Stealing from kids? Feels... wrong." "Wrong? Last month we robbed a nursing home!" "Yeah, but those old folks had insurance." Bill sighed. "Plus, they weren't celebrating their birthdays." "The Johnson kid is turning six. He won't remember half those presents anyway." "What if someone sees us?" "We'll wear clown costumes. Blend right in." "I hate clowns." "You'll hate prison more. Look, twenty minutes tops, we're in and out with a haul of expensive toys we can flip online by nightfall." "Fine. But if a kid starts crying, I'm out." "Such a softie. That's why we never rob orphanages."
id
"gen-1740436612-rvjxbXZBNs2tQcDErxFG"
upstream_id
"msg_01YBMWwbGbanVVSy78y2CiRt"
total_cost
0.00465
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:36:59.107235+00:00"
Monday, February 24th, 2025 at 10:36:59 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
350
moderation_latency
(null)
generation_time
5608
tokens_prompt
68
tokens_completion
252
native_tokens_prompt
75
native_tokens_completion
295
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00465
Evaluation details
Result
Evaluator
Details
Meta Data
20.1897%
Matches word count
n/a
neededClean
false
words
180
97.7388%
Dialogue to Total Word Ratio
Ratio: 86.11%, Deviation: 3.89%
neededClean
false
wordsTotal
180
wordsDialogue
155
58.9642%