NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1H7GAX2EDF9XJKXTZYHAX6
Run Details
Model:
Claude Opus 4.8 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
21.6s
Cost:
$0.028565
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8722%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crumpled birthday invitation across the table like it was a treasure map. "Okay, Bill. Operation Sugar Rush. The party starts at two." "I still don't understand why we're stealing presents from a child," Bill said, scratching his head. "Because, you beautiful idiot, rich kids get the good stuff. We're talking unopened tablets, designer sneakers, gift cards." She tapped the invitation. "Little Timmy Bradford is turning eight, and his daddy owns half the dealerships in town." Bill leaned back in his chair, which groaned under his considerable weight. "And how exactly do two grown adults blend into a kid's party?" "Disguises," Molly said, grinning. "You're going to be the entertainment." "The entertainment?" "A clown." Bill's face fell. "Absolutely not. You know I have a fear of clowns." "You're afraid of *being* a clown?" "It's the same fear, Molly. It transfers." She rolled her eyes and pulled a wrinkled costume from a duffel bag. "It's a rainbow wig and a red nose. You'll be fine. Meanwhile, I'll pose as a parent. While you're juggling or whatever, I sneak into the gift room and load everything into garbage bags." "Juggling?" Bill held up his hands. "I can barely catch a cold." "Then make balloon animals." "I made a balloon dog once. It looked like a tumor with ears." Molly massaged her temples. "The kids won't care. They're eight. They think dirt is delicious." She handed him the costume. "Just keep them distracted for fifteen minutes. That's all I need." Bill held the wig at arm's length like it was radioactive. "And if something goes wrong?" "What could possibly go wrong at a children's birthday party?" "That's exactly the kind of thing people say right before everything goes wrong." "We have an exit strategy," she said, ignoring him. "Side gate, behind the bouncy castle. I'll have the van running." "You're parking the getaway van behind a bouncy castle." "It's brilliant. Nobody suspects the bouncy castle." Bill sighed and tugged the rainbow wig over his head. He looked into the small mirror on the wall and immediately recoiled. "I hate everything about this." "You look adorable. Terrifying, but adorable." Molly clipped the red nose onto his face. "Now, the magic words if we need to bail. If I yell 'piñata,' you drop everything and run." "Why piñata?" "Because it'll blend in. Everyone yells piñata at these things." Bill honked the nose experimentally. "This is the dumbest plan you've ever cooked up, and you once tried to rob a bank using a kazoo." "That kazoo created a *diversion*." "It got us arrested." "It got *you* arrested. I escaped through the air vents." She slung the empty duffel bags over her shoulder. "Now come on, Bozo. We've got presents to liberate." Bill stood, his enormous shoes flopping against the floor. "If I get attacked by a swarm of sugar-crazed children, I'm telling them you're my accomplice." "Deal," Molly said, opening the door. "Just remember—smile, juggle, and whatever you do, don't cry in front of the birthday boy."
created_at
"2026-06-01T12:06:10.529Z"
6/1/2026, 12:06:10 PM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1134
moderation_latency
725
generation_time
20834
tokens_prompt
77
tokens_completion
810
native_tokens_prompt
93
native_tokens_completion
1124
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.028565
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_017DpYaeNzNFbsDCBGzSPbRR"
is_byok
false
latency
1122
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315570-q3iZpTiZKUblVNEzjknL"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315570-zwMYKIisxUAb0NvetsEO"
upstream_id
"msg_017DpYaeNzNFbsDCBGzSPbRR"
total_cost
0.028565
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
0.00%
Dialogue to Total Word Ratio
Ratio: 71.88%, Deviation: 21.88%
neededClean
false
wordsTotal
505
wordsDialogue
363
49.8722%