NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE932448AWD2Z1TD21VB5TW
Run Details
Model:
Claude Sonnet 5
Test:
N-Length Sentences
Scenario:
Write sentences with 10 words each
Duration:
7.7s
Cost:
$0.003350
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
67.14%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
# The Last Guest Rain hammered the old mansion's windows without any mercy. Detective Sarah Chen arrived precisely at midnight's darkest hour. The butler waited nervously beside the grand oak door. "They're all here," he whispered, glancing back inside quickly. Seven guests sat frozen around the mahogany dining table. Nobody dared move, speak, or even breathe too loudly. The host lay motionless, face down near his chair. Blood pooled slowly beneath him, staining the Persian rug. Sarah scanned each face, searching for signs of guilt. Someone in this room had committed cold-blooded murder. The candles flickered wildly as thunder shook the walls. "No one leaves," Sarah announced, removing her damp coat. She studied the scene, noting details others might miss. A wine glass lay shattered near the victim's hand. Something about this careful setup felt terribly wrong here. The killer, she suspected, was hiding in plain sight.
created_at
"2026-07-01T07:25:27.565Z"
7/1/2026, 7:25:27 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1645
moderation_latency
(null)
generation_time
7619
tokens_prompt
46
tokens_completion
235
native_tokens_prompt
60
native_tokens_completion
323
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.00335
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01MzCabRNrqMZ5D4tAj3eQTd"
is_byok
false
latency
1645
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890727-mz5MUF6S7PW9btw2nadT"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890727-cJZQpxJEUAyHhI6MVq78"
upstream_id
"msg_01MzCabRNrqMZ5D4tAj3eQTd"
total_cost
0.00335
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
67.14%
Matches word count
n/a
neededClean
false
sentences
17
wordCounts
0
4
1
9
2
9
3
9
4
9
5
9
6
9
7
9
8
9
9
9
10
8
11
9
12
9
13
9
14
9
15
9
16
9