NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1JA7RMYW3SYQA5Z5Q77G3G
Run Details
Model:
Claude Opus 4.8 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
10.8s
Cost:
$0.011885
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as warm air rushed out, thick with cinnamon and butter. "You're here early. Couldn't sleep either?" "How could anyone sleep? Poor Mr. Whitfield, found cold as yesterday's scones behind the post office." "Lower your voice. Mrs. Pruitt's by the window, and you know she repeats everything." "Let her. Everyone's already whispering. They say it was his heart." "His heart was fine. I saw him jogging past here Tuesday, fit as a fiddle and twice as cheerful." "Then what? You don't think someone—" "I think someone wanted his property, that's what I think. That developer's been sniffing around for months, and Whitfield was the only holdout." "You can't go accusing people based on a hunch." "It's not a hunch. It's a pattern. First the Hendersons sold after their barn fire, then the Coles after that 'accident' with the tractor." "That's just bad luck." "Bad luck doesn't wear a tailored suit and drive a silver sedan. Hand me that rolling pin, would you?" "You're baking? Now?" "I think better with flour on my hands. Besides, the funeral's tomorrow, and somebody has to bring something to the reception." "And while you're there—" "While I'm there, I'll listen. People always talk over cake."
created_at
"2026-06-01T12:25:08.633Z"
6/1/2026, 12:25:08 PM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1651
moderation_latency
(null)
generation_time
10728
tokens_prompt
81
tokens_completion
334
native_tokens_prompt
107
native_tokens_completion
454
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011885
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_016bukkS5fgHe7FuTRpWHjTA"
is_byok
false
latency
1651
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316708-rJDyqLdwTVEi6D33VqSq"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316708-FWvUpxwiVfHWdPWSRd9x"
upstream_id
"msg_016bukkS5fgHe7FuTRpWHjTA"
total_cost
0.011885
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)