NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-16-01KHK9KJ4YKZP1WGM91YA4SMNC
Run Details
Model:
GPT-5.1
Test:
N-Length Sentences
Scenario:
Write sentences with 10 words each
Duration:
27.4s
Cost:
$0.018651
Provider:
OpenAI
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
Rain hammered the glass as Detective Mara Quinn approached Whitfield. The mansion loomed ahead, windows black, driveway slick with secrets. Inside, the lights flickered like nervous witnesses awaiting cross-examination tonight. A butler opened cautiously, studying Mara's badge with open suspicion. Behind him, voices trembled, clustered around the grand piano's shadow. The call had mentioned screams, then silence, then rushing water. Water still dripped somewhere, irregular as a frightened heartbeat nearby. Mara stepped inside, cataloging expensive art and poorly hidden fear. A shattered champagne flute glittered beneath the chandelier like starlight. Blood marred the ivory keys, drying in hesitant, uneven streaks. On the loveseat, four guests waited, arranged like mismatched portraits. Their clothes whispered money; their eyes screamed something harder, dangerous. Mara noted who stared at the keys, who deliberately didn't. In the adjoining conservatory, paramedics hovered around an unseen body. A woman's laughter echoed faintly, recorded on someone's forgotten phone. Mara exhaled slowly, tasting copper, perfume, and the inevitable tonight.
created_at
"2026-02-16T13:18:28.437225+00:00"
2/16/2026, 1:18:28 PM
model
"openai/gpt-5.1-20251113"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
25657
moderation_latency
(null)
generation_time
27176
tokens_prompt
46
tokens_completion
308
native_tokens_prompt
41
native_tokens_completion
1860
native_tokens_completion_images
0
native_tokens_reasoning
1616
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.01865125
router
(null)
provider_responses
0
id
"resp_0b4b6b1445239a2d0169931908775881978beed4cd12e6237d"
status
200
is_byok
false
latency
174
endpoint_id
"764eb97f-8bab-4326-b29b-7a8799b00a70"
provider_name
"OpenAI"
model_permaslug
"openai/gpt-5.1-20251113"
api_type
"completions"
id
"gen-1771247880-So4QdkBnrVHhX4I0uEHJ"
upstream_id
"resp_0b4b6b1445239a2d0169931908775881978beed4cd12e6237d"
total_cost
0.01865125
cache_discount
(null)
upstream_inference_cost
0
provider_name
"OpenAI"
Evaluation details
Result
Evaluator
Details
Meta Data
98.3801%
Matches word count
n/a
neededClean
false
sentences
16
wordCounts
0
10
1
10
2
11
3
10
4
10
5
10
6
10
7
10
8
10
9
10
10
10
11
10
12
10
13
10
14
10
15
10