NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
N-Length Sentences
Scenario
Write sentences with 10 words each
Run ID
2026-02-08-01KGYYGADDH032XAQSCH5GED6R
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The fog rolled thick across the old Victorian mansion's grounds. Detective Sarah Chen stepped from her car into darkness. A butler waited by the door, his face unreadable. "Thank you for coming so quickly," he said nervously today. Inside, the body lay sprawled across the marble entrance hall. Blood pooled beneath the victim's head, still wet, still fresh. Lord Ashworth's eyes stared upward, frozen in permanent shocked surprise. The murder weapon, a candlestick, lay beside him now. Six guests remained trapped inside by the storm outside tonight. One of them was a killer hiding among them.
created_at
"2026-02-08T15:39:16.105296+00:00"
Sunday, February 8th, 2026 at 3:39:16 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1383
moderation_latency
180
generation_time
4063
tokens_prompt
35
tokens_completion
125
native_tokens_prompt
44
native_tokens_completion
142
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.002262
router
(null)
provider_responses
0
id
"4ee4da87-74e3-4075-9243-63fd9cf365cc"
status
200
is_byok
false
latency
1383
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770565151-3QKWtKdLI19ZHeJx2ePA"
upstream_id
"4ee4da87-74e3-4075-9243-63fd9cf365cc"
total_cost
0.002262
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
89.6327%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
10
1
9
2
9
3
10
4
10
5
10
6
10
7
9
8
10
9
9