NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1HSKWXSFTVXKYY9EB5H6XX
Run Details
Model:
Claude Opus 4.8 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
20.8s
Cost:
$0.027490
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.03%
Dialogue to Total Word Ratio
(details)
49.8855%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the rickety table, smoothing out the crayon-stained paper she'd swiped from the community center's bulletin board. The floor plan of the Henderson residence looked less like a fortress and more like a place where dreams went to be smeared with frosting. "Are you absolutely certain about this?" Bill asked, peering over her shoulder while nervously twisting his fake mustache. "Robbing a six-year-old feels like a new low, even for us." "It's not robbing a child. It's strategically relocating gifts that haven't even been opened yet." Molly tapped the diagram. "Think of it as preventing spoiled behavior. We're practically heroes." Bill remained unconvinced, his eyes drifting to the crude drawing of a unicorn someone had scribbled in the corner of the page. He'd never been good at justifying their crimes, which was precisely why Molly handled the moral gymnastics. She circled the backyard with a red marker. "The bounce house is here. That's our entry point. The presents will be stacked on this table, near the piƱata." "And the parents?" "Distracted. There's a magician scheduled for two o'clock." She grinned, clearly proud of her reconnaissance. "Nobody watches the gifts when there's a guy pulling rabbits out of hats. Basic human psychology." Bill paced the length of their cramped hideout, stepping over empty pizza boxes and yesterday's regrets. The plan was technically sound, but something gnawed at him beneath his cheap thrift-store blazer. "What about the dog?" he finally asked. Molly's confident expression flickered. "What dog?" "The Hendersons have a golden retriever named Biscuit. I saw it in their holiday photos online. Friendly-looking thing, but you never know." She waved him off dismissively. "Biscuit won't be a problem. We'll bring bacon. Every dog has a price, and that price is usually breakfast meat." Bill sank into a folding chair that groaned in protest. He'd been a thief for fifteen years, pilfered diamonds from galas and yanked wallets from oblivious tourists, but never once had his target list included balloon animals and party favors. Somehow, their criminal empire had shrunk to this pathetic dimension. "We've really hit rock bottom, haven't we?" he murmured. "Speak for yourself. I see opportunity." Molly began packing a duffel bag with suspicious enthusiasm. "Do you know what kids get these days? Tablets. Gaming consoles. Premium action figures still in their original packaging. We could clear three grand, easy." That number caught his attention. Three grand would cover rent, plus the gambling debt he hadn't mentioned to her yet. "Fine," Bill conceded, rising to his feet with renewed determination. "But if a toddler starts crying, we abort. I have limits, Molly. I have a soul somewhere in here." "Your soul can buy us groceries after we pull this off." She tossed him a clown nose, the centerpiece of their disguise. "Now put this on. We're going to a party." He stared at the red foam ball in his palm, contemplating every life decision that had led to this exact moment, then sighed and snapped it onto his face.
created_at
"2026-06-01T12:16:04.003Z"
6/1/2026, 12:16:04 PM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1206
moderation_latency
101
generation_time
20669
tokens_prompt
77
tokens_completion
810
native_tokens_prompt
93
native_tokens_completion
1081
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.02749
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01Q2Bccx6RcVfLgkYJGoJLpX"
is_byok
false
latency
1205
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780316164-v1JK3mXepaRSFTHcIYOa"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780316164-F3tzohWBRpZugyTQiqTp"
upstream_id
"msg_01Q2Bccx6RcVfLgkYJGoJLpX"
total_cost
0.02749
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
504
0.03%
Dialogue to Total Word Ratio
Ratio: 42.83%, Deviation: 12.83%
neededClean
false
wordsTotal
509
wordsDialogue
218
49.8855%