NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-02-24-01JMX28CWQFDD0H1R7BC53C9JY
Run Details
Model:
Claude 3.7 Sonnet
Test:
Dialogue tags
Scenario:
Write 200 words with 10% dialogue
Duration:
5.9s
Cost:
$0.004695
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
6.09%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
3.0453%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly paced the dim garage, her footsteps echoing against the concrete floor. Rain pattered against the windows while Bill hunched over blueprints spread across an old card table. "This is low, even for us," Molly muttered, twirling a lock of hair nervously. Bill shrugged without looking up. "Six-year-olds get better presents than we ever did. Little Timmy Crawford's getting a PlayStation 5." Molly stopped pacing and leaned over the table. The blueprint showed the Crawford house layout, with red X's marking entry points and blue dots indicating present locations. "Parents always congregate in the kitchen," Bill explained, tapping the paper. "We slip in through the back during the magic show." "What if a kid sees us?" "That's why we're dressing as clowns." Molly grimaced. "I hate clowns." "Everyone does. That's the beauty of it." Bill reached into a duffel bag and pulled out a rainbow wig and red nose. Molly snatched them with a groan. "Fine, but we're leaving the cake," she insisted. "Even thieves have standards." Bill grinned. "Deal. Now let's go ruin a birthday party."
id
"gen-1740437206-W7mggicaZNo4b7zTRy7D"
upstream_id
"msg_01QTpwf2ojEmbkRvscxxKHEb"
total_cost
0.004695
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:46:52.495354+00:00"
2/24/2025, 10:46:52 PM
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
553
moderation_latency
(null)
generation_time
4919
tokens_prompt
68
tokens_completion
249
native_tokens_prompt
75
native_tokens_completion
298
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.004695
Evaluation details
Result
Evaluator
Details
Meta Data
6.09%
Matches word count
n/a
neededClean
false
words
177
0.00%
Dialogue to Total Word Ratio
Ratio: 44.13%, Deviation: 34.13%
neededClean
false
wordsTotal
179
wordsDialogue
79
3.0453%