NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ915338MEM25VEK2RG2GYN
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.1s
Cost:
$0.010560
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.07%
Dialogue to Total Word Ratio
(details)
49.9929%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the windshield of Bill's rusted Pontiac as he spread the hand-drawn map across the steering wheel. It had taken him three full days to complete, and Molly had to admit, for a man who still used MapQuest, it was impressively detailed. He had color-coded the entry points with highlighters and drawn tiny cartoon children near what he was calling the threat zones. "You put googly eyes on the clown," Molly said. "For morale," Bill replied, completely serious. The target was a birthday party for one Ethan Prescott, turning seven years old this Saturday at the Prescott family home on Sycamore Drive. According to Bill's intelligence, which he had gathered primarily by standing near the school fence looking suspicious, there would be approximately fourteen children in attendance. The gift table, positioned beneath the kitchen window based on previous parties held at this address, would contain an estimated forty-seven wrapped presents. Molly studied the map and tapped the area marked BOUNCY CASTLE with her fingernail. Getting through or around the inflatable structure would be their biggest logistical challenge. Children treated bouncy castles the way generals treated fortified positions, chaotically and with enormous enthusiasm. "We go in through the side gate," Molly said, "while the magician has their attention." "The magician starts at two. We have a seven minute window before the balloon animals begin." "Why does balloon animal time matter?" "Kids scatter during balloon animals," Bill explained. "Every direction. No pattern, no logic. It's chaos theory with juice boxes." Molly pressed her lips together and nodded slowly. This was unfortunately accurate. She had once attended a birthday party as a legitimate guest and still had mild trauma from the balloon animal segment. The plan itself was elegantly simple, which meant it had roughly a forty percent chance of going catastrophically wrong. They would pose as late-arriving relatives, navigate past the mother stationed at the gift table, load the presents into a canvas bag disguised as a diaper bag, and exit before the birthday cake arrived. Children under seven, Bill had theorized, paid almost no attention to adults moving purposefully with bags. They were too busy screaming. "What about the dog?" Molly asked. Bill went very quiet. "There is no dog," he said carefully. "Bill. I saw a dog door." He folded the map with the focused concentration of a man trying to delay a conversation indefinitely. The rain intensified. A passing car threw a wave of gutter water across the windshield and both of them watched it slide down in silence. "His name is apparently Biscuit," Bill finally admitted. "You named the dog." "I did some thorough reconnaissance." Molly stared at him for a long moment before returning her gaze to the foggy window and the dark suburban street beyond it. Somewhere out there, forty-seven presents sat wrapped and waiting beneath a kitchen window. Somewhere closer than she would like, Biscuit was presumably also waiting. She really needed the money.
created_at
"2026-02-18T02:25:11.683495+00:00"
2/18/2026, 2:25:11 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
652
moderation_latency
134
generation_time
15779
tokens_prompt
77
tokens_completion
806
native_tokens_prompt
75
native_tokens_completion
689
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01056
router
(null)
provider_responses
0
id
"dfc739df-c144-4d18-bddb-310e6426e657"
status
200
is_byok
false
latency
651
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381495-P7ncLDmZIbgfposepzWb"
upstream_id
"dfc739df-c144-4d18-bddb-310e6426e657"
total_cost
0.01056
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
497
0.07%
Dialogue to Total Word Ratio
Ratio: 17.91%, Deviation: 12.09%
neededClean
false
wordsTotal
497
wordsDialogue
89
49.9929%