NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2025-04-14-01JRTSQG06PNHVKHBPK6VP63JN
Run Details
Model:
GPT-4.1
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
4.0s
Cost:
$0.001420
Provider:
OpenAI
Evaluation summary
Score
Evaluator
Details Link
93.44%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
Heavy rain slapped the rooftop while lightning illuminated the ornate study, revealing six nervous guests whispering secrets among tumbling shadows. A grandfather clock in the corner struck midnight, echoing through the silence, as a scream shattered the fragile peace instantly. Detective Harris sipped cold coffee, notebook ready, eyes darting between anxious faces while a sense of unease prickled against his skin. On the polished mahogany floor, blood spread in a crimson halo around the lifeless host, Arthur Bainbridge, gentleman and collector. Curtains fluttered in the drafty window, and the house creaked with secrets as the suspects glanced uneasily at the corpse. A single gold cufflink glimmered near Arthur’s outstretched hand, while footprints smeared across the rug hinted at a hasty retreat.
id
"gen-1744656123-fxqcCKF0yL8Vf63rTBlb"
upstream_id
"chatcmpl-BMJ2pSXrLz0rotwSNyQjD6vKZrm4F"
total_cost
0.00142
cache_discount
(null)
provider_name
"OpenAI"
created_at
"2025-04-14T18:42:06.138541+00:00"
4/14/2025, 6:42:06 PM
model
"openai/gpt-4.1-2025-04-14"
app_id
182717
streamed
true
cancelled
false
latency
341
moderation_latency
112
generation_time
1482
tokens_prompt
42
tokens_completion
166
native_tokens_prompt
42
native_tokens_completion
167
native_tokens_reasoning
0
num_media_prompt
(null)
num_media_completion
(null)
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00142
Evaluation details
Result
Evaluator
Details
Meta Data
93.44%
Matches word count
n/a
neededClean
false
sentences
6
wordCounts
0
20
1
20
2
21
3
20
4
20
5
20