NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHYB8AGZVSKMM4NC65FCJ2P6
Run Details
Model:
Grok 4
Test:
N-Length Sentences
Scenario:
Write sentences with 10 words each
Duration:
12.1s
Cost:
$0.005267
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
74.08%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 10 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The fog shrouded the old mansion on the hilltop. A single light flickered in the upstairs window pane. Detective Ellis arrived, his coat damp from the mist. He knocked firmly on the heavy oak front door. No answer came, only echoes in the silent night. Pushing the door open, he stepped into the foyer. A chilling scream pierced the air from somewhere upstairs. Ellis drew his revolver, heart pounding in his chest. He climbed the creaking stairs, shadows dancing around him. At the landing, a body lay sprawled in blood.
created_at
"2026-02-20T20:18:28.011Z"
2/20/2026, 8:18:28 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
6912
moderation_latency
(null)
generation_time
12027
tokens_prompt
46
tokens_completion
146
native_tokens_prompt
719
native_tokens_completion
315
native_tokens_completion_images
0
native_tokens_reasoning
204
native_tokens_cached
718
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0052665
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"9cec6f38-e403-0d8e-bf77-83cc22da9a30"
is_byok
false
latency
81
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771618708-cC2EgXzFTl0FpQrxscqS"
upstream_id
"9cec6f38-e403-0d8e-bf77-83cc22da9a30"
total_cost
0.0052665
cache_discount
0.0016155
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
74.08%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
9
1
9
2
9
3
9
4
9
5
9
6
9
7
9
8
9
9
9