NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYZ2DFA0G2884G26QBQTKD6
Run Details
Model:
Claude Opus 4.6
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
11.4s
Cost:
$0.009000
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
99.92%
Dialogue to Total Word Ratio
(details)
99.9570%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the sticky diner table and tapped the big red X. "The bouncy castle is here. That's our primary obstacle." "A bouncy castle," Bill said flatly. "Don't underestimate it. Last job, remember the laser grid in Monaco? This is worse. Kids hopped up on sugar, Bill. They're unpredictable. They bite." "So how do we get past them?" "Distraction. You go in dressed as a clown." "Absolutely not." "You'd make a wonderful clown." "I said no." "Fine. Magician?" "I'm not wearing a costume, Molly." "Then what do you suggest? We just waltz in and grab thirty wrapped presents off the table while a dozen six-year-olds watch?" "I was thinking we go through the back. Kitchen entrance." "That's where the mom will be. She's cutting the cake at two fifteen. We have a four-minute window while she carries it out and they sing Happy Birthday." "Four minutes to grab everything?" "I've got duffel bags in the van." "What about the dad?" "He's been on the margarita machine since noon. He's not a factor." Bill rubbed his temples. "We're robbing children." "We're robbing children's parents. There's a distinction." "There really isn't." "Are you in or not?" "Obviously I'm in."
created_at
"2026-02-08T15:49:16.245149+00:00"
2/8/2026, 3:49:16 PM
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1619
moderation_latency
(null)
generation_time
11292
tokens_prompt
61
tokens_completion
287
native_tokens_prompt
75
native_tokens_completion
345
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.009
router
(null)
provider_responses
0
id
"msg_01U99Yk8UJCRpjCjqTvTMwAR"
status
200
is_byok
false
latency
1619
endpoint_id
"c926c048-563e-44b0-8434-95965bee924c"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770565744-lQeTtXvHx4MZ4ZiL0uA9"
upstream_id
"msg_01U99Yk8UJCRpjCjqTvTMwAR"
total_cost
0.009
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
201
99.92%
Dialogue to Total Word Ratio
Ratio: 88.29%, Deviation: 1.71%
neededClean
false
wordsTotal
205
wordsDialogue
181
99.9570%