NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2025-02-24-01JMX0H52W6AE4C78W51MAGDCY
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly crouched behind the bush, binoculars trained on the rainbow-colored bounce house. Bill slid beside her, unfolding a crude crayon drawing of the backyard. "Sixteen presents in the gazebo," Molly whispered. "Pink table. Unattended." Bill scratched his stubble. "Guards?" "One clown, three moms, and a photographer." "The clown worries me." Molly lowered her binoculars. "He's making balloon animals. Hardly Ocean's Eleven." "Remember the magician last time? My eyebrows still haven't grown back." "That was different. This is just a five-year-old's birthday." Bill pointed to a spot on his drawing. "We enter through the side gate during the cake cutting. Maximum distraction." "Then we stuff the presents into these princess backpacks," Molly added, patting her disguise. "Isn't this... morally questionable?" "Says the man who robbed a nun." "She wasn't a real nun!" Molly rolled her eyes. "Look, we grab, we go. Twenty seconds max." "What about the pinata?" "Leave it." "But it's shaped like Spider-Man." "Focus, Bill." He sighed, zipping up his stolen Chuck E. Cheese employee vest. "Let's ruin some childhood memories."
id
"gen-1740435396-3JNwMP4HJs11cUAm9Wlc"
upstream_id
"msg_01Ke7CN1Dr3ajTBt8r6Gv1kT"
total_cost
0.005235
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:16:42.694924+00:00"
Monday, February 24th, 2025 at 10:16:42 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
394
moderation_latency
(null)
generation_time
5404
tokens_prompt
68
tokens_completion
262
native_tokens_prompt
75
native_tokens_completion
334
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.005235
Evaluation details
Result
Evaluator
Details
Meta Data
3.6234%
Matches word count
n/a
neededClean
false
words
176
1.7059%
Dialogue to Total Word Ratio
Ratio: 64.20%, Deviation: 14.20%
neededClean
false
wordsTotal
176
wordsDialogue
113
2.6646%