NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK9YA0V635ZWJDG8SN62PRZ
Run Details
Model:
GPT-5.2
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
15.9s
Cost:
$0.014100
Provider:
OpenAI
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
Rain sheeted over Hawthorn Manor as Inspector Lyle arrived, summoned by a frantic telegram before dawn today from London headquarters. The iron gates stood ajar, and the gravel drive showed only one set of fresh tire tracks inward so far. Inside, servants whispered in tight circles, avoiding the library where Lord Merrin lay facedown beside shattered glass and ink spilled. A fire burned low, yet the room smelled of sea salt, as if a door had opened recently outside again. Lyle noted the clock stopped at 2:17, while a violin string curled around the victim's ringed finger like black hair. On the desk, a half-written letter accused someone called Sparrow, but the signature was carefully sliced away with surgical precision. When Lyle lifted the decanter, he found it warm, though the wine inside looked undisturbed and darkly clear to him. From the hallway, a single note rang, then ceased, and Lyle realized the murderer might still be listening nearby tonight.
created_at
"2026-02-16T13:24:08.959953+00:00"
2/16/2026, 1:24:08 PM
model
"openai/gpt-5.2-20251211"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
4940
moderation_latency
(null)
generation_time
15687
tokens_prompt
46
tokens_completion
622
native_tokens_prompt
41
native_tokens_completion
1002
native_tokens_completion_images
0
native_tokens_reasoning
788
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.01409975
router
(null)
provider_responses
0
id
"resp_07c6ca3a739f71710169931a688ee8819489c07ac6346e44ad"
status
200
is_byok
false
latency
346
endpoint_id
"f00142c2-6a93-49ce-9e36-5593b904ce3b"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-5.2-20251211"
api_type
"completions"
id
"gen-1771248232-mWhchLNeUNkbKf5IphUf"
upstream_id
"resp_07c6ca3a739f71710169931a688ee8819489c07ac6346e44ad"
total_cost
0.01409975
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
95.0816%
Matches word count
n/a
neededClean
false
sentences
8
wordCounts
0
20
1
20
2
20
3
20
4
20
5
21
6
20
7
20