NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTZQ2DDS6SG1YZQZ6A7J34
Run Details
Model:
Claude Opus 4.7
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
5.8s
Cost:
$0.009900
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
77.28%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The rain hammered against the windows of Ravenscroft Manor as Detective Eleanor Hayes stepped carefully across the marble foyer floor. A single candelabra flickered beside the grand staircase, casting long shadows that danced like ghosts upon the walls. The body lay facedown near the fireplace, one pale hand outstretched toward something that was no longer sitting there. Lord Ashworth had been dead for precisely three hours, according to the coroner who stood grimly beside his medical bag. Seven guests remained trapped inside the manor, each wearing expressions of carefully rehearsed shock that Eleanor immediately found deeply suspicious. The storm outside had severed every telephone line, every road, every possible connection between this house and the distant village beyond. Eleanor knelt beside the victim, noting the crystal decanter shattered across the rug and the strange symbol drawn in ash. "Nobody leaves this room," she announced quietly, her voice cutting through the tense silence like a blade through silk. One guest gasped audibly, another reached instinctively toward his jacket pocket, and somewhere upstairs a door slammed shut with finality.
created_at
"2026-04-24T13:31:06.713Z"
4/24/2026, 1:31:06 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1142
moderation_latency
(null)
generation_time
5682
tokens_prompt
46
tokens_completion
293
native_tokens_prompt
65
native_tokens_completion
383
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.0099
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"5454d6e0-f042-49b3-b8cc-09f6f611d33e"
is_byok
false
latency
1142
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037466-pDYDFJtMrdC3d85rq1KQ"
session_id
(null)
api_type
"completions"
id
"gen-1777037466-s3KDfy2DykhPtrRKytdy"
upstream_id
"5454d6e0-f042-49b3-b8cc-09f6f611d33e"
total_cost
0.0099
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
77.28%
Matches word count
n/a
neededClean
false
sentences
9
wordCounts
0
20
1
18
2
19
3
20
4
20
5
21
6
20
7
19
8
20