NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ947ZATG2WECSB1JN1XQK6
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
10.6s
Cost:
$0.005130
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
99.72%
Dialogue to Total Word Ratio
(details)
99.5469%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Right, so the clown is our biggest problem," Molly said, spreading the crayon-drawn map across the kitchen table. "The clown?" "He sees everything, Bill. Those painted eyes never close." "Molly, those are fake painted eyes. He has real eyes underneath." "We don't know that for certain." Bill rubbed his forehead. "What's our entry point?" "Bouncy castle. We blend in during the chaos, grab the presents, and exit through the garden gate before anyone sings happy birthday." "We're both forty-three years old." "Hence the disguises." "I'm not wearing a dinosaur costume." "It was either that or fairy princess." "Give me the dinosaur costume." "Smart choice. Now the presents are stacked near the kitchen, which means we pass through the jelly and sandwiches zone." "The what?" "High traffic, Bill. Sticky fingers, screaming children, someone always crying. It's essentially a war zone." "Why are we stealing from a seven year old again?" "Her father owes us twelve hundred pounds." "Right." "Besides, she's getting another party on Sunday. Different grandparents." "She's living better than us." "Considerably. Now focus. The magician arrives at three and everything changes." "Why?" "Nobody watches anything during the magician, Bill. That's our window."
created_at
"2026-02-18T02:26:47.192041+00:00"
2/18/2026, 2:26:47 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
809
moderation_latency
(null)
generation_time
10088
tokens_prompt
77
tokens_completion
353
native_tokens_prompt
75
native_tokens_completion
327
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00513
router
(null)
provider_responses
0
id
"msg_016t6WSW7NzHukxX25F5FZTG"
status
200
is_byok
false
latency
809
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381596-UU5FkxJRctz9zTZBPLdI"
upstream_id
"msg_016t6WSW7NzHukxX25F5FZTG"
total_cost
0.00513
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
195
99.72%
Dialogue to Total Word Ratio
Ratio: 92.31%, Deviation: 2.31%
neededClean
false
wordsTotal
195
wordsDialogue
180
99.5469%