NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHRYHW47P7SD22RGM9MV5QK
Run Details
Model:
Grok 4.20 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
31.6s
Cost:
$0.006589
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
In the heart of Maplewood, the Sweet Whisk Bakery hummed with the scent of cinnamon rolls and fresh coffee. Morning light spilled across the flour-dusted counter where two old friends worked side by side, one shaping dough, the other arranging glistening fruit tarts. "You won't believe what I found tucked under the doormat this morning." "What was it? Another one of those anonymous letters?" "Precisely. It said the secret ingredient in the award-winning pie isn't what everyone thinks." "That's mysterious. Do you think it's referring to the blueberry pie that won at the fair?" "Has to be. Old Mr. Jenkins was so proud of it, but now with him missing..." "Missing? I thought he was just on vacation." "That's what they want us to think. But I saw his truck parked behind the old mill last night." "Should we tell the sheriff?" "Not yet. Let's see if we can piece this together ourselves. These things always have a way of unraveling in the most unexpected ways." "Like that time with the missing cake stand?" "Exactly. Hand me the cinnamon, would you? These rolls need just a touch more." The morning light streamed in, illuminating flour motes dancing in the air as the women continued their whispered conversation, the town’s quiet mystery lingering like sugar on the tongue.
created_at
"2026-05-01T12:41:51.242Z"
5/1/2026, 12:41:51 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
29548
moderation_latency
(null)
generation_time
31483
tokens_prompt
81
tokens_completion
394
native_tokens_prompt
185
native_tokens_completion
2570
native_tokens_completion_images
(null)
native_tokens_reasoning
2304
native_tokens_cached
64
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00658905
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"87cfcd10-aac0-975d-b324-52fd7ea75be0"
is_byok
false
latency
216
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639311-ojAlEjDXqawBNkJZqfel"
session_id
(null)
api_type
"completions"
id
"gen-1777639311-7bHp1gmWIez3rBN0h7q1"
upstream_id
"87cfcd10-aac0-975d-b324-52fd7ea75be0"
total_cost
0.00658905
cache_discount
0.0000672
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)