NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WQQBFPB79BAP24KYD0Q8M
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
9.7s
Cost:
$0.006321
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
100.00%
Dialogue to Total Word Ratio
(details)
71.6892%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"The bouncy castle is our biggest problem," said Molly. "Why?" "Because you'll want to go in it." "I won't want to go in it." "Bill, you went in the one at the Henderson job." "That was recon." "You were in there for forty minutes." "It was a very good bouncy castle." Molly unrolled the hand-drawn map across the hood of their van. "Focus. The presents are stacked behind the gift table, here. The clown is stationed near the entrance, here. And the birthday kid patrols the perimeter like a tiny angry general." "How old is this kid?" "Six." "Terrifying." "Exactly. Now, I'll create a distraction by loosening the string on the helium balloons. Every child within twenty feet will lose their mind completely." "Genius." "While that happens, you load the presents into the bags and we're out before anyone sings happy birthday." "What if the clown spots us?" "Nobody looks directly at a clown, Bill. It's a human instinct." "What about the cake?" Molly stared at him. "We are not stealing the cake." "I'm just saying it looked really good in the photos."
created_at
"2026-02-24T22:36:22.776Z"
2/24/2026, 10:36:22 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
779
moderation_latency
(null)
generation_time
9688
tokens_prompt
77
tokens_completion
387
native_tokens_prompt
92
native_tokens_completion
403
native_tokens_completion_images
(null)
native_tokens_reasoning
90
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.006321
router
(null)
provider_responses
0
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
id
"msg_01EUwVjhHTR7rosT9nGb1Rdw"
is_byok
false
latency
776
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Anthropic"
status
200
api_type
"completions"
id
"gen-1771972582-jSG8phfwRmWw2316kTXh"
upstream_id
"msg_01EUwVjhHTR7rosT9nGb1Rdw"
total_cost
0.006321
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
183
100.00%
Dialogue to Total Word Ratio
Ratio: 90.22%, Deviation: 0.22%
neededClean
false
wordsTotal
184
wordsDialogue
166
71.6892%