NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYX8XKEQ6MFMQ1MBAZ44A8T
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write unattributed dialogue
Duration:
9.4s
Cost:
$0.011285
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Count dialogue tags
(details)
Model input
User/Human
You are an expert novel writer in the cozy mystery genre. Write a scene of about 200 words, with dialogue between two characters. The dialogue should be unattributed, meaning the speaker is not explicitly mentioned. The scene should be set in a small town bakery. Only return the story, nothing else.
Model response
Messages
Meta
AI
The bell above the door chimed as the scent of cinnamon drifted through the warm air. "You're here early. The cinnamon rolls aren't even out of the oven yet." "I'm not here for the rolls. I heard about Mr. Pemberton." "Terrible business, that. Found him slumped over his desk, they say. Heart, probably." "That's what everyone keeps saying. But did you notice his teacup?" "His teacup? Can't say I was invited to the scene, dear. What about it?" "Margaret mentioned it was still full. Stone cold and full to the brim." "So the man didn't finish his tea. Hardly a scandal." "Henry Pemberton drank three cups every morning before he'd say a word to anyone. Never left a drop. You know that as well as I do." "I do, now that you mention it. He'd come in here demanding his scones piping hot and complain if I'd let them sit a minute." "Exactly. So why would a man who couldn't bear cold tea leave a full cup untouched?" "Unless something stopped him before he could drink it." "Or someone." The oven timer buzzed, sharp and sudden in the quiet. "I'll get the rolls. You get the kettle on. If we're going to puzzle this through, we'll need proper tea."
created_at
"2026-05-31T11:38:56.508Z"
5/31/2026, 11:38:56 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
999
moderation_latency
(null)
generation_time
9318
tokens_prompt
81
tokens_completion
312
native_tokens_prompt
107
native_tokens_completion
430
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011285
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01KX3PXkzEmqzbXsbZx5JGB9"
is_byok
false
latency
999
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227536-nUfuo7tiPprdhKFreWzt"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227536-5X1UTPxMUChemBlTT8RD"
upstream_id
"msg_01KX3PXkzEmqzbXsbZx5JGB9"
total_cost
0.011285
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Count dialogue tags
n/a
neededClean
false
dialogueTags
(empty)