NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-20-01KHY8C7MH2102V6JW5VRDP1CQ
Run Details
Model:
Grok 4
Test:
N-Length Sentences
Scenario:
Write sentences with 20 words each
Duration:
15.3s
Cost:
$0.008911
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
55.16%
Matches word count
(details)
Model input
User/Human
Please write the opening scene of a murder mystery where each sentence has exactly 20 words. Please make sure that each sentence has the same length. Only return the opening itself.
Model response
Messages
Meta
AI
The fog-shrouded manor loomed on the hilltop, its ancient stone walls whispering secrets of long-forgotten scandals and hidden betrayals. Detective Elena Voss approached the wrought-iron gates, her trench coat billowing in the chill autumn wind that carried distant thunder. Inside, candlelight flickered in the grand hall, casting eerie shadows on portraits of stern ancestors staring down accusingly. A butler with trembling hands opened the heavy oak door, his face pale as if he'd seen a ghost. Elena stepped into the opulent drawing room, where guests in formal attire murmured nervously around a velvet-draped table. Suddenly, a scream pierced the tension, echoing from the upstairs library where the wealthy host was expected to appear. Rushing up the spiral staircase, Elena found the door ajar, revealing a scene of chaos and spilled crimson blood. The host lay motionless on the Persian rug, a antique dagger protruding from his chest, his eyes frozen in terror. Clues scattered like puzzle pieces: a torn letter, a shattered vase, and footprints leading to the open window. Elena knelt to examine the body, knowing this murder would unravel a web of lies among the assembled suspects.
created_at
"2026-02-20T19:28:10.397Z"
2/20/2026, 7:28:10 PM
model
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
8610
moderation_latency
(null)
generation_time
15244
tokens_prompt
46
tokens_completion
315
native_tokens_prompt
719
native_tokens_completion
558
native_tokens_completion_images
0
native_tokens_reasoning
322
native_tokens_cached
718
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.0089115
router
(null)
provider_responses
0
endpoint_id
"d40bef9d-6936-4c57-9749-bf7399de4a77"
id
"805035db-bd28-79bd-984b-ecd59a41eb88"
is_byok
false
latency
1557
model_permaslug
"x-ai/grok-4-07-09"
4/7/2009, 12:00:00 AM
provider_name
"xAI"
status
200
api_type
"completions"
id
"gen-1771615690-u7v7OQGgW90ybsRDC71W"
upstream_id
"805035db-bd28-79bd-984b-ecd59a41eb88"
total_cost
0.0089115
cache_discount
0.0016155
upstream_inference_cost
0
provider_name
"xAI"
Evaluation details
Result
Evaluator
Details
Meta Data
55.16%
Matches word count
n/a
neededClean
false
sentences
10
wordCounts
0
21
1
21
2
18
3
19
4
19
5
19
6
19
7
20
8
18
9
19