NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-05-31-01KSYXB2B41BBZ8YWZMQ8YYFXQ
Run Details
Model:
Claude Opus 4.8 (Reasoning, High)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
20.0s
Cost:
$0.027415
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.11%
Dialogue to Total Word Ratio
(details)
50.0166%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the crayon-drawn map across the table and tapped it with one finger. "Okay, Bill. The party starts at two. We hit it at two-fifteen, while they're distracted by the clown." Bill leaned closer, squinting. "There's a clown?" "There's always a clown." "I hate clowns." "Everyone hates clowns, Bill. That's not relevant to the operation." Bill straightened up and crossed his arms. "It's relevant to me. What if he sees us?" "He's a children's entertainer named Mr. Boppo. He's not going to alert Interpol." "You don't know that. Clowns have a network. They communicate through the balloons." Molly stared at him for a long moment. "I genuinely don't know how to respond to that, so I'm going to keep going. We go in through the back, near the bouncy castle—" "The bouncy castle's a death trap." "It's a bouncy castle, Bill. For five-year-olds." "It's an inflatable structure with unstable footing. One wrong step and I'm face-down in front of thirty children and a balloon-coded clown." Molly pinched the bridge of her nose. "Can we focus? The presents are stacked on the gift table by the window. We grab everything, load it into the van, and we're gone before anyone cuts the cake." "What kind of cake?" "Why does that matter?" "Because if it's ice cream cake, I want to time it so we leave with a slice." "We are not stopping for cake." "You never let me have anything nice." Molly slid a photograph across the table. "This is the target. Birthday boy. Turning six. His grandparents are loaded, so these presents are premium. We're talking name-brand toys, sealed video games, possibly a tablet." Bill picked up the photo and frowned. "He looks sad." "He does not look sad." "He's got sad little eyes. Like a tiny businessman who's lost everything." "Bill." "I'm just saying, we're stealing presents from a sad tiny businessman. That's dark, Molly. Even for us." "Last week you stole a wheelchair." "That was for the plot! It was part of a larger plan!" "It was outside a hospital." Bill threw up his hands. "We agreed never to bring up the wheelchair!" Molly took a deep, steadying breath. "Let's just review the exit. Once we have the presents, we go out the side gate, into the alley, and the van's waiting. Two minutes, in and out." "What's my alias?" "You don't need an alias. We're wearing masks." Bill's face lit up. "What kind of masks? Please say superhero. I want to be the fast one." "They're plain black ski masks." "That's so boring. We're robbing a six-year-old. We should commit to a theme." Molly rolled up the map slowly. "You know what? Forget the theme. Forget the cake. Just be at the van at two o'clock." Bill nodded solemnly, then paused. "One question." "What." "If Mr. Boppo offers me a balloon animal, am I allowed to take it?" Molly walked out of the room without another word. "That's a yes!" Bill called after her. "I'm getting a balloon dog!"
created_at
"2026-05-31T11:40:06.896Z"
5/31/2026, 11:40:06 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1092
moderation_latency
(null)
generation_time
19935
tokens_prompt
77
tokens_completion
792
native_tokens_prompt
93
native_tokens_completion
1078
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.027415
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_01PrsiPfZ5vasS85XmHo8B1Q"
is_byok
false
latency
1092
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780227606-ijeU2nbXIJyNZR4JhOzj"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780227606-ogC2ydIuraDowI7kb00a"
upstream_id
"msg_01PrsiPfZ5vasS85XmHo8B1Q"
total_cost
0.027415
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
0.11%
Dialogue to Total Word Ratio
Ratio: 81.64%, Deviation: 11.64%
neededClean
false
wordsTotal
512
wordsDialogue
418
50.0166%