NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ9546EWJH075XRWQJ5PVB4
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
14.8s
Cost:
$0.010290
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9927%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain had been falling sideways for forty minutes, which Bill considered a personal insult from the universe. He hunched over the hand-drawn map spread across the dashboard of the van, the paper slowly absorbing the humidity of his breathing and curling at the corners like a sad letter nobody wanted to read. Molly sat beside him eating a bag of crisps with the enthusiasm of someone who had not just spent three hours in a hedge conducting surveillance on a six-year-old's birthday party. Bill tapped the map with a pen that had run out of ink, which made the gesture purely theatrical. The target was a semi-detached house on Primrose Close, currently occupied by approximately fourteen children aged between five and seven, two tired parents, one exhausted grandmother, and a man in a clown costume who had arrived on a bicycle and hadn't seemed terribly happy about it. "The presents are stacked in the hallway," Bill said. "I counted at least twenty. Maybe twenty-five." Molly crunched loudly. "Did you count the clown as a threat or a distraction?" "Both," he said grimly. The plan, as much as it could be called that, involved waiting until the pass-the-parcel portion of the festivities reached its musical climax, at which point the hallway would be entirely unoccupied and the side gate would be accessible through a latch that Bill had already established was broken. He had established this by accidentally walking into it while pretending to be someone who lived on the street, a performance he felt deserved considerably more credit than it received. Molly folded the crisp packet into a neat square and tucked it into her pocket, which was the most organised thing she had done all week. "The wrapping paper alone is worth something," she said. "People go mad for nice wrapping paper." Bill stared at her. "We are not stealing the wrapping paper, Molly." "I'm just saying it was expensive looking." "We are professionals." The word professionals hung between them in the foggy van with considerably less dignity than Bill had intended. They were, by any reasonable assessment, two adults in waterproof jackets planning to rob a child's birthday party in the rain on a Tuesday afternoon. The moral architecture of this particular enterprise was not something Bill allowed himself to examine too closely, on the grounds that close examination had a way of leading to lifestyle changes he wasn't ready for. Through the windscreen, the house glittered with fairy lights. A burst of high-pitched screaming erupted from somewhere inside, followed immediately by the unmistakable sound of a kazoo being played badly. "That'll be pass the parcel starting," Bill said, reaching for the door handle. "Should we feel bad about this?" Molly asked. He paused, hand on the handle, rain drumming on the roof like impatient fingers. "The presents are insured," he said, though he had no possible basis for knowing this. They got out of the van.
created_at
"2026-02-18T02:27:20.47634+00:00"
2/18/2026, 2:27:20 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1095
moderation_latency
130
generation_time
14620
tokens_prompt
77
tokens_completion
769
native_tokens_prompt
75
native_tokens_completion
671
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01029
router
(null)
provider_responses
0
id
"3c157daf-7f16-4103-ac51-4b0a662d33ce"
status
200
is_byok
false
latency
1095
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381625-bdF88OWmE8rLMgfjvxZV"
upstream_id
"3c157daf-7f16-4103-ac51-4b0a662d33ce"
total_cost
0.01029
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
0.00%
Dialogue to Total Word Ratio
Ratio: 15.06%, Deviation: 14.94%
neededClean
false
wordsTotal
498
wordsDialogue
75
49.9927%