NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
N-Length Sentences
Scenario
Write sentences with 20 words each
Run ID
2025-02-24-01JMX296FXGF646MVDTP6W5YM4
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The rain fell heavily outside the creaking old mansion, casting long shadows across the dimly lit entrance hall. Detective Sarah Morgan shivered as she stepped over the threshold, water dripping from her coat onto marble floors. A grandfather clock ticked ominously in the corner, its metallic rhythm counting down the seconds since the grisly discovery. The victim lay sprawled at the bottom of the grand staircase, eyes frozen open in an expression of terrible surprise. Lady Eleanor Blackwood's lifeless body wore an elegant evening gown, now stained dark crimson where the knife had penetrated. Uniformed officers moved carefully around the scene, photographing evidence and measuring blood spatter with practiced, solemn efficiency. "The housekeeper found her just after midnight," whispered Inspector Collins, his weathered face grave beneath the flickering chandelier. "Everyone present at the dinner party is waiting in the library, and not a single one has a solid alibi."
id
"gen-1740437232-1OzU5erynfb7TJoABYtR"
upstream_id
"msg_01F7H3y9yQL8KRBAvKuiwdqL"
total_cost
0.003387
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:47:17.197063+00:00"
Monday, February 24th, 2025 at 10:47:17 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
329
moderation_latency
(null)
generation_time
3979
tokens_prompt
42
tokens_completion
186
native_tokens_prompt
44
native_tokens_completion
217
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003387
Evaluation details
Result
Evaluator
Details
Meta Data
45.3772%
Matches word count
n/a
neededClean
false
sentences
8
wordCounts
0
18
1
18
2
19
3
20
4
19
5
17
6
18
7
20