NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYXES3MAVHN8AJ7755B0B1Q
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
9.4s
Cost:
$0.011510
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as warm air rushed out to greet the morning. "You're here early. Couldn't sleep again?" "How could I? After what they found behind the post office, nobody in this town is sleeping." "Keep your voice down. Mrs. Pennyworth is just by the muffins." "She's half deaf, and besides, she was there last night too. I saw her car." "You're imagining things. Here, try the cinnamon rolls. Fresh out." "Don't change the subject. That brooch they found—I'd swear it belonged to Eleanor." "Eleanor moved to Florida six months ago." "Did she? Or is that just what we were told?" A long pause, broken only by the scrape of a spatula against the pan. "You think too much. It's bad for the digestion." "And you think too little. Doesn't it bother you, the way she just vanished? No goodbye, no forwarding address?" "People leave. It happens." "Not Eleanor. She never missed a Sunday here. Said my sourdough was the only thing keeping her in this town." The spatula stopped. "She said that?" "The week before she disappeared. Word for word." The two women looked at each other across the flour-dusted counter, the silence thick as rising dough. "Lock the door," one whispered. "We need to talk."
created_at
"2026-05-31T11:42:08.513Z"
5/31/2026, 11:42:08 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1077
moderation_latency
(null)
generation_time
9326
tokens_prompt
81
tokens_completion
335
native_tokens_prompt
107
native_tokens_completion
439
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.01151
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_0116R8HKiAThnLCcL2a8XzV5"
is_byok
false
latency
1077
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227728-gcmP0Pxq757fPIQbzrVU"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227728-mOlPUbTxd0d8j3WgzRaN"
upstream_id
"msg_0116R8HKiAThnLCcL2a8XzV5"
total_cost
0.01151
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)