NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8VXHSMJQZPPJYNB0S4RDPR
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
1m 45s
Cost:
$0.185133
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The old gray manor house had been sitting quietly at the very edge of town for over a hundred years. Its crumbling stone walls were slowly being consumed by the dark green ivy that crept quietly up each shadowed side. That all changed on the cold and rainy Thursday evening when young Thomas Holt was found dead at the door. His pale cold body lay face down across the stone steps, one arm outstretched toward the heavy brass door knocker. Detective Inspector Mara Stone pushed open the creaking gate and swept her flashlight in an arc through the thick fog. She had seen dozens of crime scenes, but something about this one made the hairs on both her arms rise. The victim wore a very fine formal black suit, as though he had been dressed for an important evening event. There was no visible wound or mark upon him, and that fact made the whole terrible scene more deeply disturbing. Mara slowly crouched down beside the dead body and spotted a single dark red rose petal beneath his limp hand. She knew then that this was no ordinary death, and that someone deeply clever and very patient was watching her.
created_at
"2026-02-24T22:22:05.115Z"
2/24/2026, 10:22:05 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
661
moderation_latency
203
generation_time
104368
tokens_prompt
46
tokens_completion
1032
native_tokens_prompt
61
native_tokens_completion
12330
native_tokens_completion_images
(null)
native_tokens_reasoning
755
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.185133
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"77e33349-52d4-449d-96d0-0ce466b9124b"
is_byok
false
latency
660
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771971725-uoLr4Vadbjz93BOzyLHn"
upstream_id
"77e33349-52d4-449d-96d0-0ce466b9124b"
total_cost
0.185133
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
20
1
20
2
20
3
20
4
20
5
20
6
20
7
20
8
20
9
20