NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-29-01KSSK1C2XG6QAP4GNTAGQ2DR5
Run Details
Model:
Claude Opus 4.8 (Reasoning)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
9.5s
Cost:
$0.011760
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as warm air rushed in, thick with cinnamon and vanilla. "You're here early. The scones aren't even out yet." "I'm not here for scones. I'm here because Margaret Ableton was found in her garden this morning." A pause. The soft thud of a flour sack hitting the counter. "Found? You mean—" "Dead. Among her prize-winning roses, no less. The whole town's buzzing." "That's terrible. She was just in here yesterday, buying her usual lemon tart." "Was she alone?" "Now what kind of question is that?" "The kind I ask when someone dies the day after visiting the last place they were seen happy." The scrape of a chair against tile. "She wasn't alone, if you must know. She was arguing with someone. Couldn't hear about what, but she left without her tart. First time in fifteen years." "Who was she with?" "I didn't get a good look. Tall fellow, dark coat. Slipped out the back when she stormed off." "The back? Through the kitchen?" "He said he was a health inspector." "Margaret was arguing with a health inspector?" "That's what I thought was strange. We weren't due for inspection until spring." "Then who, exactly, were you letting wander through your kitchen?" The oven timer rang, sharp and sudden.
created_at
"2026-05-29T10:03:51.271Z"
5/29/2026, 10:03:51 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1149
moderation_latency
(null)
generation_time
9456
tokens_prompt
81
tokens_completion
342
native_tokens_prompt
107
native_tokens_completion
449
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.01176
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01PmPZo3xhv8sgmbXK9ns1oe"
is_byok
false
latency
1147
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780049031-HV3HZOyDY5qZFXH1gjX9"
session_id
(null)
api_type
"completions"
id
"gen-1780049031-4Q4pp93rBnoRwoZyKou9"
upstream_id
"msg_01PmPZo3xhv8sgmbXK9ns1oe"
total_cost
0.01176
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)