NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8WYJ8WA24H1VVBHV7AQVMN
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
20.6s
Cost:
$0.012411
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
95.08%
Dialogue to Total Word Ratio
(details)
96.3561%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crayon-drawn map across the hood of the rusted van and smoothed out the wrinkles with both palms. It had been drawn by her nephew three years ago and was originally meant to depict a dinosaur, but it served its current purpose well enough. "The bounce house is our biggest problem," she said, tapping a lopsided pink rectangle near the center of the page. Bill squinted at the map and then squinted at Molly. "That's a dinosaur." "It's a bounce house. Focus. The presents will be stacked on the gift table, which is here." She pointed to what was clearly labeled, in a child's handwriting, as a volcano. "Why does the volcano say gifts on it?" "Because I wrote that. Will you please concentrate?" Bill pulled his collar up against the autumn chill and leaned closer. He smelled like the gas station hotdog he had eaten thirty minutes ago and showed no remorse about it. "Okay, so we go in through the back garden gate, avoid the bounce house, grab the presents, and walk out like we own the place." "Walk out like we're catering staff," Molly corrected. "Which is why we're wearing the aprons." "Mine has a duck on it." "Ducks are professional." She rolled the map up and tucked it under her arm. "The party starts at two. The parents will be distracted by seventeen screaming six-year-olds and a magician named the Magnificent Gerald." Bill brightened considerably. "Is Gerald actually magnificent?" "Gerald does a trick where a scarf comes out of his pocket. That is the whole trick. Gerald is not magnificent." "So we have a window." "A solid forty-five minutes before the cake comes out and everyone reassembles. By then we load the presents into the van and we're gone before anyone notices we aren't holding juice boxes." Bill scratched the back of his neck and gazed across the quiet suburban street toward number fourteen, where balloons were already being tied to the front gate in aggressive quantities. A hand-painted banner read HAPPY BIRTHDAY ZOEY with a unicorn that appeared to be screaming. "I feel a little bad," he said quietly. "Taking presents from a kid." Molly turned to look at him with the weary patience of someone who had been partners with Bill for eleven years. "We are donating them to the community centre raffle. Every single present goes to charity. We have discussed this four times." "Right, right." He nodded with fresh conviction. "We're basically Robin Hood." "We are exactly Robin Hood. Except Robin Hood probably never wore a duck apron." "I think it gives me an air of authority." Molly climbed into the van, and the suspension groaned under the weight of stolen catering equipment and two large empty sacks. Bill followed, settling into the passenger seat and buckling his seatbelt with ceremonial clicking. "Operation Birthday Bandit," he announced to no one in particular. Molly started the engine. "Please never say that again," she said.
created_at
"2026-02-24T22:40:06.949Z"
2/24/2026, 10:40:06 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1152
moderation_latency
(null)
generation_time
20552
tokens_prompt
77
tokens_completion
864
native_tokens_prompt
92
native_tokens_completion
809
native_tokens_completion_images
(null)
native_tokens_reasoning
99
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.012411
router
(null)
provider_responses
0
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
id
"msg_01XJsAxkoBfXVWZtPspzvwwS"
is_byok
false
latency
1148
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Anthropic"
status
200
api_type
"completions"
id
"gen-1771972806-RWwnUDO2qqovfY4OXkha"
upstream_id
"msg_01XJsAxkoBfXVWZtPspzvwwS"
total_cost
0.012411
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
493
95.08%
Dialogue to Total Word Ratio
Ratio: 49.00%, Deviation: 1.00%
neededClean
false
wordsTotal
498
wordsDialogue
244
96.3561%