NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ91S44HXCGAKEGAK21JYH2
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
17.5s
Cost:
$0.010545
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
5.76%
Dialogue to Total Word Ratio
(details)
52.8802%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain came down in sheets outside the van, drumming against the roof in a way that Molly found deeply motivating. She spread the hand-drawn map across the dashboard, knocking an empty coffee cup onto Bill's lap in the process. Bill yelped and dabbed at his trousers with a napkin that had seen better days. He peered at the map through his thick glasses, tilting his head sideways as though the floor plan of a suburban backyard might reveal more secrets from a different angle. "The bounce house is here," Molly said, stabbing the paper with her finger. "Which means the present table has to be somewhere near the back fence." Bill squinted. "That's a flower. You drew a flower there." "That's the bounce house." "Why does the bounce house have petals?" Molly snatched the map back and stared at it. The bounce house did, somewhat undeniably, have petals. She had been distracted when she drew it, watching a documentary about decorative garden features. She folded the map with great dignity and stuffed it into the glove compartment. Their target was the Pemberton residence on Sycamore Close, where one Brandon Pemberton was celebrating his seventh birthday with approximately twenty of his closest screaming associates. The intelligence they had gathered, which consisted primarily of an invitation Molly had found blowing down the street, suggested the party ran from two until five. It was currently two forty-seven. Peak chaos. Peak distraction. Every adult within fifty meters would be locked in the thousand-yard stare of parents watching musical chairs unfold in real time. Bill pulled two large canvas tote bags from the back seat. They were from a farmers market and featured a cheerful illustration of a carrot on the side. He handed one to Molly with the solemnity of a general distributing weapons before battle. "We go in through the side gate," she said, "move along the fence line, keep low, grab the presents from the table, and we are back in the van in four minutes." "What if someone sees us?" "We are adults at a children's party. Nobody questions adults at a children's party. We look like we belong and we belong entirely." "I'm wearing a balaclava," Bill said. Molly looked at him. He was, in fact, wearing a balaclava. She had not noticed this until now, which said something about her powers of observation that she preferred not to examine closely. The balaclava was black and rolled up partially so it sat on his head like a very committed hat. "Take that off immediately," she said. "I thought it added professionalism." "It adds prison sentences." Bill removed the balaclava with the wounded expression of a man whose creative contributions are consistently undervalued. He flattened his hair with one hand and picked up his carrot bag with the other. From inside the Pemberton garden, the sound of twenty children singing happy birthday rose above the rain like a chaotic, slightly flat battle cry. It was time.
created_at
"2026-02-18T02:25:33.685773+00:00"
2/18/2026, 2:25:33 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
534
moderation_latency
(null)
generation_time
17431
tokens_prompt
77
tokens_completion
771
native_tokens_prompt
75
native_tokens_completion
688
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.010545
router
(null)
provider_responses
0
id
"msg_01UStK9Tw5TLNJzpnH9kUQyL"
status
200
is_byok
false
latency
528
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381515-pSWx1WE1RrHqOBL2Rdrm"
upstream_id
"msg_01UStK9Tw5TLNJzpnH9kUQyL"
total_cost
0.010545
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
5.76%
Dialogue to Total Word Ratio
Ratio: 22.44%, Deviation: 7.56%
neededClean
false
wordsTotal
499
wordsDialogue
112
52.8802%