NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ94JMGN9JKCFKVRAMJ264P
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
8.0s
Cost:
$0.004695
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
32.20%
Dialogue to Total Word Ratio
(details)
59.2909%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her back against the garden fence, clutching a clipboard like she was coordinating a military operation rather than robbing a six year old's birthday party. "The bouncy castle is our biggest obstacle," she whispered, sketching a rough diagram with her pen. "How is a bouncy castle an obstacle?" Bill asked. "Because you'll want to go on it. I know you, Bill." Bill opened his mouth to object, then closed it again. She had a point. "The presents are stacked by the gazebo," Molly continued, tapping her clipboard. "We go in during musical chairs. Everyone watches musical chairs." "What about the clown?" "What about him?" "He's been staring at me for ten minutes." Molly glanced over the fence. The clown was indeed staring directly at Bill with an expression of profound suspicion. "Clowns always look like that," she said. "That one has a walkie talkie." Molly looked again. He absolutely did. "Okay, new plan," she said. "Run away?" "Run away," she confirmed. They both sprinted down the street, Bill somehow still managing to glance longingly back at the bouncy castle as the clown calmly radioed in their description.
created_at
"2026-02-18T02:26:55.457517+00:00"
2/18/2026, 2:26:55 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
714
moderation_latency
(null)
generation_time
7958
tokens_prompt
77
tokens_completion
313
native_tokens_prompt
75
native_tokens_completion
298
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.004695
router
(null)
provider_responses
0
id
"msg_01XjF6qZxGtoJ82UYjGqxyFQ"
status
200
is_byok
false
latency
713
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381607-0xNkgtgCBu4ZKbrW02Pr"
upstream_id
"msg_01XjF6qZxGtoJ82UYjGqxyFQ"
total_cost
0.004695
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
189
32.20%
Dialogue to Total Word Ratio
Ratio: 39.68%, Deviation: 10.32%
neededClean
false
wordsTotal
189
wordsDialogue
75
59.2909%