NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 200 words with 50% dialogue
Run ID
2026-02-08-01KGYZYA8C4FB68MAQMRAC8PP7
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed the binoculars against her face and peered through the hedge at the bouncy castle wobbling in the afternoon sun. Streamers dangled from every fence post, and a piƱata shaped like a dinosaur swung lazily from the oak tree. "Seventeen kids," she whispered. "Maybe eighteen. Hard to count when they keep moving." Bill crouched beside her, sketching a crude map on a napkin he'd stolen from the taco truck down the street. His brow furrowed with the intensity of a man planning something far more consequential than what they were actually doing. "Where are the presents?" he asked. "Folding table by the back door. Massive pile. I'm talking gift bags, wrapped boxes, the works." "Security?" "One dad grilling hot dogs. He looks sleepy." Bill nodded slowly. "We go during the cake. Every kid on the planet loses their mind when cake appears. Total chaos. Nobody watches anything except frosting." "That's actually brilliant," Molly admitted. "What about the mom?" "Which mom?" "The one who keeps counting heads like a prison warden." Bill chewed his pen cap. "Distraction. I'll release the bubbles early." "You brought the bubble machine?" "Molly, I'm a professional." She stared at him. He absolutely was not.
created_at
"2026-02-08T16:04:28.448517+00:00"
Sunday, February 8th, 2026 at 4:04:28 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1514
moderation_latency
239
generation_time
9197
tokens_prompt
61
tokens_completion
274
native_tokens_prompt
75
native_tokens_completion
317
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0083
router
(null)
provider_responses
0
id
"af7350ff-d37e-4eff-8c38-9cba0bff8df9"
status
200
is_byok
false
latency
1514
endpoint_id
"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770566658-bUnrxe7IjDzIbJnMp44D"
upstream_id
"af7350ff-d37e-4eff-8c38-9cba0bff8df9"
total_cost
0.0083
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
199
99.9751%
Dialogue to Total Word Ratio
Ratio: 48.74%, Deviation: 1.26%
neededClean
false
wordsTotal
199
wordsDialogue
97
99.9870%