NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2026-02-08-01KGYZN61WMA6W0V1JCPR8M6WG
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the sticky diner table, smoothing out the crayon-smudged edges with her fingertips. The blueprint of the Henderson household had cost them two fruit roll-ups and a favor from a kindergartner named Dustin who couldn't keep a secret to save his juice box. Bill squinted at the diagram, turning it sideways, then upside down. "That's either the back door or a dinosaur," he said. "It's the back door. Dustin is an artist." The plan was elegant in its simplicity. Twenty-six children hopped up on cake and sugar would create the perfect cover. Nobody watches the gift table when a clown is making balloon animals and a dozen six-year-olds are screaming at frequencies that could shatter crystal. Molly had timed the piñata window down to the second. When that papier-mâché unicorn burst open, every pair of eyes would turn skyward, and she and Bill would have exactly ninety seconds to load the wagon. Bill drummed his fingers nervously on the table. The last job had gone sideways when he'd accidentally joined a game of musical chairs and gotten eliminated in the second round. The humiliation still stung. "No games this time," Molly warned. "Not even freeze tag?" "Especially not freeze tag."
created_at
"2026-02-08T15:59:29.193588+00:00"
Sunday, February 8th, 2026 at 3:59:29 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1857
moderation_latency
189
generation_time
9216
tokens_prompt
61
tokens_completion
260
native_tokens_prompt
75
native_tokens_completion
307
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00805
router
(null)
provider_responses
0
id
"bf59e3bc-a0af-4fc3-8e44-25f288c3cd2c"
status
200
is_byok
false
latency
1856
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770566359-ndyaDf7xEcO0XDLGEziB"
upstream_id
"bf59e3bc-a0af-4fc3-8e44-25f288c3cd2c"
total_cost
0.00805
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
81.2727%
Matches word count
n/a
neededClean
false
words
212
98.9471%
Dialogue to Total Word Ratio
Ratio: 13.21%, Deviation: 3.21%
neededClean
false
wordsTotal
212
wordsDialogue
28
90.1099%