NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8V7YVX28Y1TABQGHPBS2NN
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
49.4s
Cost:
$0.076071
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
99.54%
Dialogue to Total Word Ratio
(details)
96.5954%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the hood of their battered van, carefully smoothing out the crayon marks she had made during three days of meticulous and largely cold surveillance. "The bouncy castle goes up at noon," she said, tapping the map with a stubby finger. "That's when we move." Bill squinted suspiciously at the drawing. "Is that supposed to be a fence or a rainbow?" "It's a fence, Bill." "Why is it purple?" "Because I ran out of brown." Molly pinched the bridge of her nose hard. "Focus. Twelve kids, minimum. All the presents will be stacked by the garden bench while the parents are distracted by the birthday song." Bill stroked his chin thoughtfully, which always made him look vaguely wise despite never actually being so. "What about the clown?" "The clown is our problem to worry about," Molly admitted grimly. "I really don't like clowns, Molly." "Nobody likes clowns, Bill. That's precisely what makes them so dangerous." She rolled up the map decisively. "We go in during musical chairs. Nobody ever pays attention during musical chairs." Bill nodded slowly, a rare smile spreading across his weathered face. "Genius."
created_at
"2026-02-24T22:10:17.605Z"
2/24/2026, 10:10:17 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
768
moderation_latency
111
generation_time
49152
tokens_prompt
77
tokens_completion
717
native_tokens_prompt
92
native_tokens_completion
5053
native_tokens_completion_images
(null)
native_tokens_reasoning
412
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.076071
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"9efd3402-d7aa-4a46-a957-91a2dff5601b"
is_byok
false
latency
768
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771971017-l1O8FgAc9WU1cUpcQII2"
upstream_id
"9efd3402-d7aa-4a46-a957-91a2dff5601b"
total_cost
0.076071
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
191
99.54%
Dialogue to Total Word Ratio
Ratio: 52.60%, Deviation: 2.60%
neededClean
false
wordsTotal
192
wordsDialogue
101
96.5954%