NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-29-01KSSJVG253RE3KB6PBEHA7F2N
Run Details
Model:
Claude Opus 4.8 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
20.8s
Cost:
$0.029340
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the table, smoothing it with the reverence of a surgeon prepping for transplant. Bill leaned over her shoulder, chewing a toothpick that had long since lost its flavor. "So this is it," he said. "The Hendersons' backyard. Ground zero." "Keep your voice down," Molly hissed. "We're in a library." "Nobody's here but us and that old guy asleep in the biography section." Molly tapped the paper. "Focus. The party starts at two. Little Timmy turns seven. There will be presents stacked on a table approximately here, beneath a banner that reads 'Happy Birthday Champ.'" "How do you know about the banner?" "I cased the joint. Posed as a clown." Bill straightened up. "You? A clown?" "I contain multitudes, William." He held up his hands. "Okay, okay. So we waltz in, grab the loot, and waltz out. What's the catch?" "The catch," Molly said gravely, "is forty children hopped up on sugar and a bounce castle the size of a small embassy." Bill paled. "Forty?" "At minimum. Plus parents. Plus a magician named Mr. Wiggles who, according to my reconnaissance, does not take kindly to competition." "I hate magicians." "Everyone hates magicians, Bill. That's not strategy." She produced two laminated badges. "We go in as entertainment. You're the balloon artist. I'm the face painter." Bill examined his badge. "It says my name is 'Sparkles McGee.'" "Embrace it." "I don't know how to make balloon animals." Molly stared at him. "We've been planning this for three weeks. When were you going to mention that?" "It felt like a now problem." She pinched the bridge of her nose. "Just twist them and call them abstract art. Children are idiots. They'll love it." Bill nodded slowly, gaining confidence. "Abstract art. Yeah. I can do menacing shapes." "Not menacing. Whimsical." "Whimsically menacing." "Bill." "Fine." He pocketed the badge. "So once we're inside, how do we move the presents without anybody noticing two grown adults hauling away a child's entire birthday haul?" Molly smiled, the slow smile of a woman who had thought of everything. "The piñata." "The piñata?" "At three o'clock sharp, they hang a piñata shaped like a unicorn. Everyone gathers in the front yard, blindfolds get distributed, and chaos ensues. That's our window. Ninety seconds of pure, candy-fueled pandemonium." "And we just—load up the presents during the piñata?" "Into the duffel bags disguised as bounce-castle equipment. We're gone before the unicorn's head hits the ground." Bill sat back, genuinely impressed. "Molly, that's beautiful. That might be the most elegant thing you've ever planned." "I know." "There's just one thing." "What?" He leaned in. "Are we really doing this? Stealing presents from a seven-year-old?" Molly was quiet for a long moment. Then she rolled up the blueprint. "No," she sighed. "No, we're not. I've gotten soft." Bill exhaled in relief. "Thank God. Want to go rob the mall like normal people?" "Yeah." She stood. "But I'm keeping the clown costume."
created_at
"2026-05-29T10:00:38.734Z"
5/29/2026, 10:00:38 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1157
moderation_latency
(null)
generation_time
20537
tokens_prompt
77
tokens_completion
822
native_tokens_prompt
93
native_tokens_completion
1155
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.02934
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
is_byok
false
latency
194
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
429
1
endpoint_id
"dfc0e5bd-d703-4fe2-a7bb-655eb95d5441"
id
"msg_0198PuuArPG3m7y9K1dtbyrJ"
is_byok
false
latency
1157
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780048838-WqTCzodocmOfsKHZqSGe"
session_id
(null)
api_type
"completions"
id
"gen-1780048838-h0lCc0OCINmti4gxryfc"
upstream_id
"msg_0198PuuArPG3m7y9K1dtbyrJ"
total_cost
0.02934
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.00%
Dialogue to Total Word Ratio
Ratio: 74.80%, Deviation: 24.80%
neededClean
false
wordsTotal
496
wordsDialogue
371
47.9934%