NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-29-01KSSJNBMXGHR5R4C85ZANFV4N
Run Details
Model:
Claude Opus 4.8 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
20.3s
Cost:
$0.029490
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9937%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the kitchen table, smoothing out the wrinkles with the reverence of a surgeon prepping for an operation. Bill leaned over her shoulder, chewing a toothpick. "Okay," Molly said. "The Hendersons' backyard. Saturday. Two o'clock. Timmy's seventh birthday." "Seven," Bill repeated. "That's a lot of presents." "That's the *point*, Bill. Seven-year-olds are spoiled rotten these days. We're talking iPads, LEGO sets, maybe a drone." Bill straightened up. "A drone? I've always wanted a drone." "Focus." Molly tapped the paper. "Here's the bounce castle. Here's the snack table. And here—" she circled an X in red marker, "—is the present pile. Mount Treasure." "What about security?" "Twenty children and four adults. The adults will be distracted by the clown." Bill's toothpick dropped from his mouth. "There's a clown?" "There's always a clown." "I hate clowns, Molly. You know I hate clowns." "You robbed an armored truck last spring." "Armored trucks don't have giant red noses and squeaky shoes." Molly pinched the bridge of her nose. "The clown is *our friend*, Bill. While he's twisting balloon animals, nobody's watching the presents. We walk in, we grab the loot, we walk out. Five minutes, tops." Bill paced the kitchen. "And how exactly do two grown adults walk into a child's birthday party without anyone noticing?" "That," Molly said, grinning, "is the genius part." She reached under the table and produced a plastic bag, dumping its contents out. Two costumes tumbled free—one a sparkly fairy outfit, the other a foam dinosaur suit. Bill stared. "No." "You're the entertainment. I'm the face-painter." "I am not wearing that." "Then you're not getting the drone." Bill picked up the dinosaur head and turned it over in his hands like it was a dead animal. "Why do I have to be the dinosaur?" "Because you've got the build for it. Now, the getaway. Our van is parked here, by the fence. We toss the presents over, hop in, and we're gone before the cake's even cut." "What if a kid sees us?" Molly shrugged. "Who's a kid gonna believe? Themselves, or a magical dinosaur and a beautiful fairy?" Bill had to admit, grudgingly, that the logic was airtight. He tugged the dinosaur head on and his voice came out muffled. "This smells like feet." "You look adorable. Now repeat the plan." "Walk in. Distract the children. Grab Mount Treasure. Toss it over the fence. Don't get caught by the clown." "And?" Bill sighed inside the costume. "And under no circumstances do I eat any of the cake, no matter how good it looks." "Exactly. We are professionals." "Professionals dressed as a fairy and a dinosaur." Molly slipped on her wings and admired herself in the toaster's reflection. "The greatest heist crew this neighborhood has ever seen." Bill waddled toward the door, tail dragging behind him. "If I trip over these feet and faceplant in front of toddlers, I'm quitting crime forever." "Sure you will," Molly said, grabbing the van keys. "Right after the drone."
created_at
"2026-05-29T09:57:17.608Z"
5/29/2026, 9:57:17 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1256
moderation_latency
(null)
generation_time
20225
tokens_prompt
77
tokens_completion
811
native_tokens_prompt
93
native_tokens_completion
1161
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.02949
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01ScVk8bZX468TV4J2g3hF2n"
is_byok
false
latency
1256
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780048637-4FMivlarG7lptHOefWtV"
session_id
(null)
api_type
"completions"
id
"gen-1780048637-QCQ2SXL8pR76J1YH3pe9"
upstream_id
"msg_01ScVk8bZX468TV4J2g3hF2n"
total_cost
0.02949
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
502
0.00%
Dialogue to Total Word Ratio
Ratio: 64.36%, Deviation: 14.36%
neededClean
false
wordsTotal
505
wordsDialogue
325
49.9937%