NC Bench
Overview
About
Tests
Models
Vendors
Run: 2026-02-18-01KHQ93XX8AWZMJCB39YEP6ZRB
Run Details
Model:
Claude Sonnet 4.6
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
19.2s
Cost:
$0.010545
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.49%
Dialogue to Total Word Ratio
(details)
48.2380%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain came down in sheets across the parking lot of Funzone Party Palace, which was decorated outside with approximately forty-seven soggy balloons and a banner that read HAPPY BIRTHDAY TREVOR in letters so aggressively cheerful they bordered on threatening. Molly sat low in the passenger seat of their stolen minivan, which they had stolen specifically because it was the least suspicious vehicle imaginable, and which now smelled overwhelmingly of goldfish crackers. She studied the building through a pair of binoculars she had borrowed from her neighbor Gerald, who believed she was birdwatching. Gerald was a trusting man. Gerald deserved better friends. Bill drummed his fingers on the steering wheel in a rhythm that had no discernible pattern and was driving Molly quietly insane. He was wearing a fleece vest, which he believed made him look inconspicuous and which actually made him look like a substitute geography teacher. "The presents will be piled near the gift table," Molly said, "which based on the window layout is somewhere in the northeast corner." Bill stopped drumming. "How do you know there's a gift table?" "There is always a gift table, Bill. That is the law of children's parties. It is as reliable as gravity." He nodded slowly, absorbing this with the careful reverence he usually reserved for weather forecasts and soup recommendations. The front doors burst open and a stream of small children poured out, chasing each other around the parking lot with the unhinged energy of creatures who had consumed an amount of sugar that would hospitalize a medium-sized adult. A clown followed them out, looking absolutely haunted. "That clown has seen things," Bill said quietly. Molly lowered the binoculars. "Focus." Their plan was straightforward, which was precisely what made Molly nervous. Simple plans had a way of becoming complicated the moment Bill got involved. Last March, a simple plan to lift wallets at a farmer's market had somehow concluded with Bill accidentally purchasing a goat named Henderson, who now lived in their garage and had eaten three of Molly's shoes. The party guests would be occupied with cake in approximately eleven minutes. They had studied the schedule, which the extremely organized parents of Trevor had posted publicly on a community Facebook group, because people simply posted everything now and Molly chose not to question her good fortune. "We walk in, we look like we belong, we load the presents into the trolley, and we walk out," she said. "What if someone asks who we are?" "We are Aunt Susan and Uncle Dave." Bill considered this. "I don't look like a Dave." "You absolutely look like a Dave." He seemed wounded by this in a way that was disproportionate and deeply personal, pressing his lips together and staring out at the rain with the expression of a man reconsidering many of his life choices. Molly checked her watch. Nine minutes until cake. She cracked her knuckles. Henderson had eaten her good shoes and somebody was going to pay for that, even if that somebody was technically Trevor.
created_at
"2026-02-18T02:26:45.456127+00:00"
2/18/2026, 2:26:45 AM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
477
moderation_latency
(null)
generation_time
19138
tokens_prompt
77
tokens_completion
801
native_tokens_prompt
75
native_tokens_completion
688
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.010545
router
(null)
provider_responses
0
id
"msg_01WeMPeszRgaW5E3a4uv1gfJ"
status
200
is_byok
false
latency
477
endpoint_id
"6d408764-7dd7-4626-bb87-a6cc1589bc86"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
api_type
"completions"
id
"gen-1771381585-07T740HUIaEzT7c7sHjK"
upstream_id
"msg_01WeMPeszRgaW5E3a4uv1gfJ"
total_cost
0.010545
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
508
0.49%
Dialogue to Total Word Ratio
Ratio: 19.69%, Deviation: 10.31%
neededClean
false
wordsTotal
508
wordsDialogue
100
48.2380%