NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
N-Length Sentences
Scenario
Write sentences with 10 words each
Run ID
2025-02-24-01JMWZQ2D3D3BSW3T6MB9GY09G
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
## The Silent Echo Detective Morgan arrived at the crime scene on a rainy night. The yellow tape fluttered gently in the wind around the house. Raindrops tapped a melancholy rhythm on the wooden porch steps. A single light bulb swayed from the ceiling inside. The victim lay sprawled across the antique Persian living room rug. Blood had seeped into the intricate patterns of the fabric. Detective Morgan knelt down to examine the mysterious golden cufflink. It didn't belong to the victim, a woman named Sarah. Her phone was missing, but her wallet remained untouched nearby. The murderer had been careful, methodical, almost surgically precise perhaps. On the wall hung a family portrait with one face scratched out. Morgan noticed the faint scent of expensive cologne lingering behind. The grandfather clock in the hallway had stopped at midnight exactly. Outside, a neighbor watched from their window with fearful eyes. Something about this scene felt personal, vengeful, and coldly calculated.
id
"gen-1740434541-8zCQwy0yCM5mQcnfV9gD"
upstream_id
"msg_01CeBHemD4uyCtrVU4Zcprz3"
total_cost
0.003462
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:02:27.438789+00:00"
Monday, February 24th, 2025 at 10:02:27 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
488
moderation_latency
(null)
generation_time
4482
tokens_prompt
42
tokens_completion
198
native_tokens_prompt
44
native_tokens_completion
222
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003462
Evaluation details
Result
Evaluator
Details
Meta Data
81.2832%
Matches word count
n/a
neededClean
false
sentences
16
wordCounts
0
4
1
11
2
11
3
10
4
9
5
11
6
10
7
10
8
10
9
10
10
10
11
12
12
10
13
11
14
10
15
10