NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYA28G9WFHG9VQXZ58BK3K9
Run Details
Model:
Grok 4
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
14.2s
Cost:
$0.006556
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
77.52%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The misty fog enveloped the quaint Victorian mansion as Detective Eliza Thorne arrived at the dimly lit driveway late that evening. She stepped out of her car, noticing the eerie silence broken only by distant owl hoots and crunching gravel underfoot. Inside, the grand hallway smelled of polished wood and faint lavender, with shadows dancing from a flickering chandelier overhead. A butler greeted her solemnly, his face pale and drawn, leading her toward the study where tragedy had just unfolded. There, sprawled on the Persian rug, lay the body of millionaire industrialist Victor Hargrove, a single gunshot wound piercing his chest. Eliza knelt down carefully, observing the scattered papers and overturned lamp, clues whispering secrets of betrayal and hidden motives. Outside, rain began to patter against the windows, as suspects gathered in the parlor, each harboring their own dark suspicions.
created_at
"2026-02-20T19:57:40.755Z"
2/20/2026, 7:57:40 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6965
moderation_latency
(null)
generation_time
14164
tokens_prompt
46
tokens_completion
234
native_tokens_prompt
719
native_tokens_completion
401
native_tokens_completion_images
0
native_tokens_reasoning
226
native_tokens_cached
718
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0065565
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"3eb1c949-af41-e852-84af-ea963216c8ec"
is_byok
false
latency
74
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771617460-98I4T5OkcqzyBbZtOzMo"
upstream_id
"3eb1c949-af41-e852-84af-ea963216c8ec"
total_cost
0.0065565
cache_discount
0.0016155
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
77.52%
Matches word count
n/a
neededClean
false
sentences
7
wordCounts
0
21
1
20
2
19
3
20
4
21
5
19
6
20