NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-06-01KK1HA8M4RWHZH9DMVECXZV7P
Run Details
Model:
GPT-5.4 (Reasoning)
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
3.6s
Cost:
$0.001677
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
80.33%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
Rain needled the manor windows as mourners gathered, whispering beneath chandeliers that trembled like trapped stars above their black umbrellas. At precisely midnight, the grandfather clock groaned, and Lady Vale collapsed beside the hearth, crimson blooming across her pearl gown. No one admitted hearing the shot, yet every face blanched equally when the butler locked the doors without speaking. Inspector Crowe arrived through thunder and mud, studying twelve guests whose careful condolences already sounded rehearsed, brittle, and dangerous.
created_at
"2026-03-06T12:17:48.171Z"
3/6/2026, 12:17:48 PM
model
"openai/gpt-5.4-20260305"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
427
moderation_latency
(null)
generation_time
3592
tokens_prompt
46
tokens_completion
140
native_tokens_prompt
41
native_tokens_completion
105
native_tokens_completion_images
0
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0016775
router
(null)
provider_responses
0
endpoint_id
"9ff5625c-403f-4d7f-b895-58ac7295062c"
id
"resp_0af53a980eb30a880169aac5ec39e48193a1190e31cd96a861"
is_byok
false
latency
152
model_permaslug
"openai/gpt-5.4-20260305"
provider_name
"OpenAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1772799468-xHWH2tdtKMJz2CAIEi5t"
upstream_id
"resp_0af53a980eb30a880169aac5ec39e48193a1190e31cd96a861"
total_cost
0.0016775
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
80.33%
Matches word count
n/a
neededClean
false
sentences
4
wordCounts
0
20
1
20
2
19
3
19