NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHS8M03GJVZ66D6PSE9CZF8
Run Details
Model:
Grok 4.20
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
5.6s
Cost:
$0.000792
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door tinkled as warm cinnamon air wrapped around her like a hug. Flour dusted the wooden counter where two women stood shoulder to shoulder, rolling out pie crusts. “These blackberries are practically bursting. Mrs. Henderson’s bushes must have had a good year.” “Too good. I heard she’s been out there at dawn with a bucket, scaring off the birds with that ridiculous straw hat of hers.” A soft chuckle filled the space between the clatter of rolling pins. Sunlight slanted through the lace curtains, catching on sugar crystals scattered like fresh snow. “Think we should add a lattice top or just keep them open-faced this time?” “Open-faced. Let those berries shine. Besides, Mr. Pickering prefers to see what he’s getting. Reminds him of his late wife’s recipe, poor man.” The dough sighed under their palms, stretching into perfect circles. Outside, a dog barked once and children laughed past the window. “Still can’t believe she used cornstarch instead of tapioca. Ruined three batches before I convinced her.” “Some people have to learn the hard way. Just like life, I suppose.” The scent of vanilla and melted butter drifted between them, sweeter than any secret.
created_at
"2026-05-01T12:47:21.097Z"
5/1/2026, 12:47:21 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
450
moderation_latency
(null)
generation_time
5466
tokens_prompt
81
tokens_completion
327
native_tokens_prompt
183
native_tokens_completion
252
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00079155
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"8f935057-8032-0eea-5578-7ad68fabc99f"
is_byok
false
latency
142
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639641-OHQzQQSpHYI1CvjAjrtR"
session_id
(null)
api_type
"completions"
id
"gen-1777639641-bEUgVGqU7KaKYhUlkPlK"
upstream_id
"8f935057-8032-0eea-5578-7ad68fabc99f"
total_cost
0.00079155
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)