NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WNY07MAH6HZ8WB01MCSAM
Run Details
Model:
Claude Opus 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
10.6s
Cost:
$0.010010
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.14%
Dialogue to Total Word Ratio
(details)
50.0285%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the dashboard, which appeared to be crayon on construction paper. She had stolen it from the birthday boy's cubby at preschool, and it smelled faintly of juice boxes. "The bounce house is here," she said, tapping a purple blob. "That's our primary obstacle. Kids go feral in those things." Bill squinted at the map and turned it upside down. He was wearing all black, which seemed excessive for a Tuesday afternoon in the suburbs. "What about the clown?" he asked. "The clown is a wildcard. We avoid the clown at all costs." Bill nodded slowly, cracking his knuckles with the gravity of a man about to defuse a bomb rather than rob a six-year-old. "I'll create a distraction near the cake table," he said. "You grab the presents from the folding table by the fence." "And if the mother spots us?" "We tell her we're from the balloon company." Molly pulled two helium tanks from the backseat, grinning with the pride of someone who had thought of everything. The party started in twenty minutes. Somewhere inside that backyard sat a mountain of wrapped gifts just begging to be liberated. "Let's go steal from some children," she whispered.
created_at
"2026-02-24T22:35:24.047Z"
2/24/2026, 10:35:24 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1466
moderation_latency
171
generation_time
10350
tokens_prompt
77
tokens_completion
382
native_tokens_prompt
92
native_tokens_completion
382
native_tokens_completion_images
(null)
native_tokens_reasoning
68
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.01001
router
(null)
provider_responses
0
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id
"905f58dc-56fb-44fa-ab23-703273a8ee3b"
is_byok
false
latency
1425
model_permaslug
"anthropic/claude-4.6-opus-20260205"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771972524-UngVoDw6RULPaUyytE5i"
upstream_id
"905f58dc-56fb-44fa-ab23-703273a8ee3b"
total_cost
0.01001
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
203
0.14%
Dialogue to Total Word Ratio
Ratio: 33.98%, Deviation: 16.02%
neededClean
false
wordsTotal
206
wordsDialogue
70
50.0285%