NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-01-01KQHRRTWDNAM4Q3E6RB9XMA05
Run Details
Model:
Grok 4.20
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
4.2s
Cost:
$0.000677
Provider:
xAI
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the bakery door gave a cheerful tinkle as Harriet stepped inside, the warm scent of cinnamon and yeast wrapping around her like a hug. Sunlight slanted through the gingham curtains, catching on the flour-dusted counter where Agnes was sliding a tray of golden scones from the oven. “Still warm from the gossip mill, I see.” “Would you expect anything less in Maplewood? That new librarian was in here at dawn, buying two bear claws and asking the strangest questions about the old mill.” Harriet selected a lemon poppy seed muffin, turning it slowly. “Questions about the mill… or about the night Everett’s tractor mysteriously rolled into the pond?” “Both. She wanted to know if the mill wheel had ever been used to grind anything besides grain.” A soft laugh escaped as Agnes wiped her hands on her apron. “I told her the only thing ground at that mill lately is reputations.” Harriet broke off a piece of muffin, savoring the bright citrus burst. “Then we’d better keep our eyes open. Small towns have big secrets, and I suspect this one is about to rise like your sourdough.”
created_at
"2026-05-01T12:38:43.86Z"
5/1/2026, 12:38:43 PM
model
"x-ai/grok-4.20-20260309"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
386
moderation_latency
(null)
generation_time
4117
tokens_prompt
81
tokens_completion
296
native_tokens_prompt
183
native_tokens_completion
233
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
128
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"completed"
usage
0.00067685
router
(null)
provider_responses
0
endpoint_id
"3ecee37f-b217-4093-87fb-aaf0afe307af"
id
"9483f6ee-c47d-8401-ec96-293898e28391"
is_byok
false
latency
89
model_permaslug
"x-ai/grok-4.20-20260309"
provider_name
"xAI"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777639123-bBAfWJLrZPEMbuHYA5AI"
session_id
(null)
api_type
"completions"
id
"gen-1777639123-RE5loG06Kxt7yo2twXS2"
upstream_id
"9483f6ee-c47d-8401-ec96-293898e28391"
total_cost
0.00067685
cache_discount
0.0001344
upstream_inference_cost
0
provider_name
"xAI"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)