NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ918CKNMH1FXWHVVDBVPGG
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
15.0s
Cost:
$0.011205
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
29.84%
Dialogue to Total Word Ratio
(details)
55.5559%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain tapped against the windshield of Bill's rusted Pontiac like an impatient landlord. Molly spread the hand-drawn map across the dashboard, knocking an empty coffee cup onto Bill's lap. "Watch it," Bill said. "Focus," Molly said. "Now, the birthday boy goes in for cake at three o'clock. That gives us a twelve minute window." Bill squinted at the map. "You drew the bouncy castle in incredible detail for something that has nothing to do with the job." "I was bored," Molly said. "The point is, the presents are stacked on the gift table, which is here, next to the clown." "I don't like clowns," Bill said. "Nobody asked you, Bill." "I'm just saying, as a professional consideration." Molly folded the corner of the map down and stared at him. Outside, a minivan full of small children pulled into the driveway of the house across the street. Streamers had been tied to the mailbox and were immediately destroyed by the wind. "There she is," Molly said. "Seven years old today. You know what she's getting for her birthday?" "Nothing," Bill said. "Nothing," Molly agreed, and she smiled in a way that suggested she had practiced it. Bill drummed his fingers on the steering wheel. He was wearing a fake mustache that he had not been asked to wear and had not mentioned to Molly, who had been pretending not to notice it for forty minutes. "The clown is the variable," Bill said. "Clowns are unpredictable. They have big shoes. You can't hear them coming." "We are not going to be outwitted by a party clown," Molly said. "You say that now." "I say that always. That's my consistent position on clowns." Bill peeled the mustache off, examined it briefly, and pressed it back on. Molly watched this happen and decided not to pursue it. "What about the parents?" Bill asked. "Inside, running the cake situation," Molly said. "There's some kind of ice cream emergency, according to my source." "Your source is the neighbor kid." "My source has eyes and a bicycle and very few ethical commitments. He's perfect." Bill looked at the house again. A balloon escaped from somewhere in the backyard and floated up past their windshield with quiet dignity. They both watched it go. "We grab the presents, we're back in the car in nine minutes," Molly said. "Clean and simple." "Clean and simple," Bill repeated, like a man who had heard those exact words before and had the scar tissue to prove it. "Don't do that voice," Molly said. "What voice?" "The voice where you repeat what I say but somehow make it sound like a funeral." "That's just my voice," Bill said. Molly rolled up the map and tucked it into her jacket. She checked her watch. She looked at the house. She looked at Bill and his mustache and his funeral voice and briefly reconsidered her entire career. "Let's go," she said.
created_at
"2026-02-18T02:25:13.821067+00:00"
2/18/2026, 2:25:13 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1105
moderation_latency
243
generation_time
14720
tokens_prompt
77
tokens_completion
750
native_tokens_prompt
75
native_tokens_completion
732
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.011205
router
(null)
provider_responses
0
id
"df41dd35-f3b9-45fd-985e-72c396fc9b6a"
status
200
is_byok
false
latency
1103
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381498-5ZtPeGquIUZQETTwWYel"
upstream_id
"df41dd35-f3b9-45fd-985e-72c396fc9b6a"
total_cost
0.011205
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
29.84%
Dialogue to Total Word Ratio
Ratio: 45.08%, Deviation: 4.92%
neededClean
false
wordsTotal
488
wordsDialogue
220
55.5559%