NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY7YH6EPS93J9R78VFTC16C
Run Details
Model:
Grok 4
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
16.6s
Cost:
$0.007949
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
63.81%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The fog-shrouded mansion loomed on the hill, its windows like eyes watching the guests arrive one by one that night. Eleanor stepped inside first, her elegant gown whispering secrets, while the host greeted her with a smile hiding unknown motives. Detective Harlan followed closely behind, invited unexpectedly, his keen eyes scanning the room for any signs of impending trouble. The butler announced dinner promptly at eight, as thunder rumbled outside, setting a tense atmosphere among the assembled strangers. Seated around the antique table, conversations flowed uneasily, with whispers of old grudges surfacing like ghosts from the past. Suddenly, a scream pierced the air from upstairs, freezing everyone in place as footsteps hurried toward the source of alarm. They found Lord Harrington sprawled on the floor, blood pooling beneath him, a dagger clutched in his lifeless hand. Eleanor gasped in horror at the sight, while Harlan knelt down, examining clues that might reveal the killer's identity. The storm raged on outside the windows, trapping them all inside with suspicions mounting higher than the howling winds. Who among these guests held the motive, Harlan wondered silently, as alibis began crumbling under his piercing gaze.
created_at
"2026-02-20T19:20:41.433Z"
2/20/2026, 7:20:41 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
7197
moderation_latency
(null)
generation_time
16542
tokens_prompt
46
tokens_completion
323
native_tokens_prompt
719
native_tokens_completion
488
native_tokens_completion_images
0
native_tokens_reasoning
243
native_tokens_cached
679
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00794925
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"a8ae87bf-2a69-785f-6dce-1dd62c56af1d"
is_byok
false
latency
382
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771615241-tr5d70YRsYD98K5ZK15K"
upstream_id
"a8ae87bf-2a69-785f-6dce-1dd62c56af1d"
total_cost
0.00794925
cache_discount
0.00152775
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
63.81%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
21
1
20
2
19
3
19
4
19
5
20
6
19
7
19
8
19
9
18