NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HJ7YFYWGZ7XAS5JJJXV9S
Run Details
Model:
Claude Opus 4.8 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
20.1s
Cost:
$0.026715
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8723%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the kitchen table, smoothing out the wrinkles with the side of her hand. It was, technically, a birthday party invitation, but she'd circled the important parts in red marker. "Okay," she said. "The target is one Timmy Henderson, age seven. The objective is the present table." Bill leaned over her shoulder, squinting. "Why is there a clown drawn on the invitation?" "That's not on the invitation. I drew that." "Why?" "Because clowns are scary, Bill. It sets the tone." Bill pulled up a chair and sat down heavily. "I just want it on the record that I think robbing a child is the lowest we've ever gone." "Lower than the retirement home?" "We gave the dentures back." "After we lost the bet." Molly tapped the paper. "Focus. The party starts at two. Cake at three. That's our window." "Why during cake?" "Because, my dear Bill, when there is cake, all eyes are on the cake. Nobody watches the presents during cake. It's basic psychology." Bill rubbed his chin. "And how do we get in? Last I checked, two adults showing up to a kid's party without a kid looks suspicious." "That's why," Molly said, reaching under the table, "we have these." She produced two enormous cardboard heads. One was a purple dinosaur. The other was something that may have once been a cat. Bill stared. "Absolutely not." "You'll be Mr. Whiskers. I'll be Dino." "I am a forty-six-year-old man, Molly." "And Mr. Whiskers is timeless." She shoved the cat head into his arms. "Parents love entertainers. They'll practically throw the kids at us. We blend in, we work the crowd, and when the cake comes out, we load the presents into the magic bag." Bill turned the head over in his hands. "There's a magic bag?" "It's a garbage bag. But the kids don't know that." "This is a terrible plan." "It's a beautiful plan. Now, the getaway." She slid a photo across the table. It showed a small pink bicycle with tassels and a basket. Bill blinked. "That's our getaway vehicle?" "It was the only thing in the garage that wouldn't draw attention at a children's party." "A grown man on a child's bike is the most attention a person can draw." "Not if he's wearing a cat head." Bill set the head down very slowly. "Molly. I love you. We've been partners for eleven years. But I need you to hear me when I say this is the dumbest scheme you have ever cooked up, and you once tried to rob a bank using a sock puppet." "The sock puppet almost worked." "The teller called the puppet 'adorable' and gave it a lollipop." Molly stood, pulling the dinosaur head down over her own. Her voice came out muffled and triumphant. "Mr. Whiskers," she boomed, "we leave at thirteen hundred hours." Bill sighed, looked at the cat head, and looked at the tiny pink bicycle leaning against the wall. "I'm going to need a bigger bag," he muttered.
created_at
"2026-06-01T12:12:02.389Z"
6/1/2026, 12:12:02 PM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1234
moderation_latency
(null)
generation_time
20018
tokens_prompt
77
tokens_completion
763
native_tokens_prompt
93
native_tokens_completion
1050
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.026715
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_016bNVA4FzAV3s6FaCx3rqe3"
is_byok
false
latency
1234
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315922-y18Ymwlvn3dBBafKZAIH"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315922-DiLLyaEJzueUxn9mLHwU"
upstream_id
"msg_016bNVA4FzAV3s6FaCx3rqe3"
total_cost
0.026715
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
0.00%
Dialogue to Total Word Ratio
Ratio: 66.27%, Deviation: 16.27%
neededClean
false
wordsTotal
507
wordsDialogue
336
49.8723%