Model

Claude 3.7 Sonnet

Test

N-Length Sentences

Scenario

Write sentences with 20 words each

Run ID

2025-02-24-01JMX26R7G27H9D6SX5A9D9D9E

Model input

User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.

Model response

AI
The old mansion creaked and settled in the darkness as thunder rolled ominously across the brooding night sky. Detective Sarah Morgan pulled her coat tight against the chill, her footsteps echoing on the rain-slicked cobblestone path. Yellow police tape fluttered in the wind like macabre party decorations around the grand entrance of Ravenscroft Manor. Inside, the victim lay sprawled across an antique Persian rug, one hand reaching toward something only he could see. Blood had pooled beneath the body, seeping into the intricate patterns of the expensive textile like dark artistic intention. The victim's face remained frozen in an expression of surprise, as if death had arrived completely unexpected that evening. "Time of death approximately nine thirty," muttered the medical examiner, looking up from the cooling corpse with practiced detachment. Sarah noticed a crumpled note clutched in the victim's other hand, partially hidden beneath the weight of his body. The room held five suspects, each with secrets darker than the storm outside, each avoiding contact with the others.
id"gen-1740437152-VCmtM7F3bkUclZLl48Uf" 
upstream_id"msg_013c8tYAEu3fBHSK7FkeLDVL" 
total_cost0.003717
cache_discount(null)
provider_name"Anthropic" 
created_at"2025-02-24T22:45:57.355449+00:00" Monday, February 24th, 2025 at 10:45:57 PM GMT+00:00
model"anthropic/claude-3-7-sonnet-20250219:beta" 
app_id182717
streamedtrue
cancelledfalse
latency388
moderation_latency(null)
generation_time3951
tokens_prompt42
tokens_completion211
native_tokens_prompt44
native_tokens_completion239
native_tokens_reasoning0
num_media_prompt(null)
num_media_completion(null)
num_search_results(null)
origin"https://ncbench.com/" 
is_byokfalse
finish_reason"stop" 
native_finish_reason"stop" 
usage0.003717

Evaluation details

Result Evaluator Details Meta Data
50.1821% Matches word count n/a
neededCleanfalse
sentences9
wordCounts
018
119
218
319
419
519
619
719
819