NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYXG87K0N7G1Y6Q4X3HRBPG
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
22.7s
Cost:
$0.028290
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
97.63%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
48.8138%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the table, smoothing its wrinkles with theatrical reverence. "Behold. The Hendersons' backyard. Site of our greatest triumph." "It's a drawing of a swing set," Bill said. "It's a drawing of *opportunity*, Bill." She tapped the page. "Tommy Henderson turns seven on Saturday. Do you know what that means?" "That we've hit rock bottom?" "It means presents. Mountains of them. Wrapped, ribboned, and completely unguarded except by adults who'll be distracted by a clown." Bill leaned back, arms crossed. "There's a clown?" "There's always a clown." "I hate clowns." "Everyone hates clowns. That's why nobody watches them. We slip past during the magic show, grab the gift table, and vanish before they cut the cake." Bill picked up a juice box she'd inexplicably placed on the table and studied it like evidence. "And how exactly do two grown adults blend into a seven-year-old's birthday party?" Molly grinned and produced two enormous bunny costumes from a duffel bag. "We don't blend in. We become entertainment." "Absolutely not." "Bill—" "I have my dignity." "You stole a piggy bank from a Chuck E. Cheese last month." "That was different. The piggy bank had it coming." He held up the costume by one floppy ear, frowning. "These smell like regret and cotton candy." "They smell like *success*." She stuffed his into his arms. "Look, you wear the suit, you wave, you hand out balloons. The parents think you're hired help. Meanwhile, I'm loading presents into the van shaped like an ice cream truck." "We have an ice cream truck now?" "Rented. Plays a delightful little tune." Bill rubbed his temples. "This is the worst plan you've ever proposed, and last year you wanted to rob a bank using carrier pigeons." "The pigeons would have worked if they were loyal." Molly waved this away. "Focus. The kids are the real threat here. They're small, they're fast, and they have zero respect for personal space. If one of them figures out we're not real bunnies, it's over." "So we're afraid of children now." "You should always be afraid of children. They smell fear. And jelly." Bill sighed, the sound of a man who had run out of better options sometime around his thirties. "What's the take? Realistically." "Action figures. A bicycle, probably. Some cash in envelopes from grandparents who never know what to buy." She leaned forward, eyes gleaming. "And rumor has it, Tommy's getting a *gaming console*." "A console." Bill's resistance visibly cracked. "Sealed? In the box?" "Pristine." He was quiet a moment. Then he picked up the bunny head and turned it slowly in his hands like a tragic prince contemplating a skull. "If I do this," he said, "I'm not hopping." "You'll have to hop a little." "I will not hop, Molly." "The character is *defined* by the hopping." Bill jammed the bunny head on with grim finality, his voice now muffled and full of doom. "I'm going to hate every second of this." "That's the spirit," Molly beamed. "Now let's go ruin a child's birthday."
created_at
"2026-05-31T11:42:56.766Z"
5/31/2026, 11:42:56 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1118
moderation_latency
(null)
generation_time
22620
tokens_prompt
77
tokens_completion
812
native_tokens_prompt
93
native_tokens_completion
1113
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.02829
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01Sa33uyYxfBTcL6bZ65XAy4"
is_byok
false
latency
1118
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227776-o9yi9ceF6xLmFCnKSBb3"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227776-OPPzNKoTCL2QNyo33jHf"
upstream_id
"msg_01Sa33uyYxfBTcL6bZ65XAy4"
total_cost
0.02829
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
97.63%
Matches word count
n/a
neededClean
false
words
507
0.00%
Dialogue to Total Word Ratio
Ratio: 70.33%, Deviation: 20.33%
neededClean
false
wordsTotal
509
wordsDialogue
358
48.8138%