NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1H3EJM5Q8GYWE09PDTWV6Q
Run Details
Model:
Claude Opus 4.8 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
19.9s
Cost:
$0.028440
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
19.54%
Dialogue to Total Word Ratio
(details)
59.7625%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the kitchen table, smoothing out the creases with the reverence of a surgeon preparing for a delicate operation. The crayon drawing depicted a backyard, a bouncy castle, and what appeared to be a lopsided gazebo. "Where did you even get this?" Bill asked, squinting at the artwork. "Reconnaissance," Molly said. "I borrowed it from the birthday boy's older sister. Told her I was an art critic." Bill picked up the paper and turned it sideways, then upside down. A glittery sticker fell off and landed in his coffee. He fished it out, examined it, and decided not to ask. The plan was elegant in its simplicity. Saturday afternoon, two o'clock, the Hendersons would host approximately fifteen seven-year-olds for their son Timmy's birthday extravaganza. According to Molly's intelligence, the present table would groan under the weight of wrapped boxes, gift bags, and at least three remote-control vehicles she'd already promised to a fence named Antonio. "Here's how it goes," Molly said, tapping the blueprint. "I create a distraction at the piñata. You slip in through the side gate, grab the loot, and we rendezvous at the getaway minivan." "A minivan." "Nobody suspects a minivan, Bill. It's the perfect crime." Bill leaned back in his chair, arms crossed. He'd been in this business for twenty years. He'd robbed banks, casinos, and once, memorably, a yacht belonging to a minor European prince. This felt different. This felt wrong. "We're stealing from children," he said. "We're stealing from children's *parents*," Molly corrected. "There's a difference. Those parents bought overpriced plastic garbage that'll be forgotten by Tuesday. We're practically doing them a favor. Decluttering, really." She had a point, Bill thought, though he hated admitting it. He'd seen the inside of those gift bags. Last month they'd practically given away a perfectly good drone at a six-year-old's party in the suburbs. "What about the cake?" Bill asked. "What about it?" "Do we get cake?" Molly stared at him for a long moment. "We are professional thieves, Bill. We do not stop for cake." "I'm just saying, if there's cake right there—" "No cake." Bill sighed and returned his attention to the blueprint. He noticed something circled in red marker near the gazebo. "What's this?" "The clown," Molly said gravely. "Goes by the name Bubbles. Word on the street is he's an off-duty cop moonlighting for extra cash. We avoid Bubbles at all costs." "A clown cop." "The most dangerous kind. Now, synchronize your watch." Bill looked down at his wrist, where he wore no watch. He'd pawned it last week to cover the minivan rental. Molly noticed, rolled her eyes, and pulled a child's plastic timepiece from her pocket, sliding it across the table. It featured a cartoon dinosaur and lit up when pressed. "Found it in the recon materials," she explained. Bill strapped it on, pressed the button, and watched the dinosaur glow. "We're going to prison," he muttered. "Probably," Molly agreed cheerfully. "But we'll have presents."
created_at
"2026-06-01T12:03:57.657Z"
6/1/2026, 12:03:57 PM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
983
moderation_latency
(null)
generation_time
19862
tokens_prompt
77
tokens_completion
811
native_tokens_prompt
93
native_tokens_completion
1119
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.02844
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01Vhu7xeBXZfNwgdWyUDeZBM"
is_byok
false
latency
977
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315437-HlgAOxUxy08MbPTwYIqF"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315437-CW9aUcwNaLDj9qdXjY4k"
upstream_id
"msg_01Vhu7xeBXZfNwgdWyUDeZBM"
total_cost
0.02844
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
19.54%
Dialogue to Total Word Ratio
Ratio: 35.71%, Deviation: 5.71%
neededClean
false
wordsTotal
504
wordsDialogue
180
59.7625%