NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 90% dialogue
Run ID
2025-02-24-01JMWZZJSSE63M3XS5KCDEE5A0
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Bill, you ready to go over the plan?" Molly slid into the booth, dropping a folder on the table. "Born ready. What's the score on this one?" "Little Emily Bergman, turning six. Parents went all out—designer toys, gaming systems, maybe even cash." Bill whistled. "At a kiddie party? Times have changed." "Not just any party," Molly tapped the folder. "Carnival theme, twenty children, fifteen adults, three clowns." "Clowns? Nobody said anything about clowns." "Relax. We go in as party entertainers. You'll be the balloon artist." "I can't make balloon animals!" "Just twist something together and call it a giraffe. Kids don't know the difference." "And you?" "Face painter. While I'm decorating little cheeks, I'll scope the gift table." "What's our exit strategy?" "Bathroom window. It faces the back alley where I'll park the van." "Timing?" "We hit during the cake. All eyes on the birthday girl." "Brilliant. One question though." "What?" "Doesn't it feel... wrong? Stealing from kids?" Molly stared at him. "Remember what happened at the casino job?" "Fair point. I'll practice making balloon dogs."
id
"gen-1740434820-3SPjmGlbw1CQ3JnxCOKz"
upstream_id
"msg_011m1Kge3PDbPVFjrBgPjYZD"
total_cost
0.00495
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:07:07.590891+00:00"
Monday, February 24th, 2025 at 10:07:07 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
576
moderation_latency
(null)
generation_time
6070
tokens_prompt
68
tokens_completion
265
native_tokens_prompt
75
native_tokens_completion
315
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00495
Evaluation details
Result
Evaluator
Details
Meta Data
3.6234%
Matches word count
n/a
neededClean
false
words
176
99.8608%
Dialogue to Total Word Ratio
Ratio: 88.07%, Deviation: 1.93%
neededClean
false
wordsTotal
176
wordsDialogue
155
51.7421%