NC Bench
Overview
About
Tests
Models
Model
Claude 3.7 Sonnet
Test
N-Length Sentences
Scenario
Write sentences with 10 words each
Run ID
2025-02-24-01JMX09HP6B8Z7MQAR9D7J1AT6
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The body lay sprawled across the marble floor, blood pooling around. Detective Morgan arrived at the scene, his face etched with concern. Yellow crime scene tape fluttered in the cold autumn breeze. The victim's eyes stared vacantly at the ornate crystal chandelier. A single bullet hole marked the center of his chest. The mansion's silence was broken only by the detective's footsteps. "Time of death, approximately four hours ago," stated the coroner. Morgan knelt beside the body, studying the man's pale face. "Who found him?" he asked, pulling on latex examination gloves. The young officer consulted his notes with nervous, trembling hands. "His daughter called it in at precisely seven fifteen, sir." A sterling silver watch on the victim's wrist had stopped. Morgan noticed a crumpled note clutched in the dead hand. Carefully, he extracted the paper without disturbing potential evidence. Five words written in elegant script: "The debt is now paid."
id
"gen-1740435147-9DJJa0DcwHXb837wsrpF"
upstream_id
"msg_01MhmgNnMeYmEBfJGmUcnrVW"
total_cost
0.003567
cache_discount
(null)
provider_name
"Anthropic"
created_at
"2025-02-24T22:12:34.93608+00:00"
Monday, February 24th, 2025 at 10:12:34 PM GMT+00:00
model
"anthropic/claude-3-7-sonnet-20250219:beta"
app_id
182717
streamed
true
cancelled
false
latency
592
moderation_latency
(null)
generation_time
4991
tokens_prompt
42
tokens_completion
198
native_tokens_prompt
44
native_tokens_completion
229
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.003567
Evaluation details
Result
Evaluator
Details
Meta Data
93.0885%
Matches word count
n/a
neededClean
false
sentences
15
wordCounts
0
11
1
11
2
10
3
10
4
10
5
10
6
10
7
10
8
10
9
10
10
10
11
10
12
10
13
9
14
11