NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-06-01-01KT1GV9WR7Q756F2YMBZ5TMBB
Run Details
Model:
Claude Opus 4.8 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
20.8s
Cost:
$0.028690
Provider:
Anthropic
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9996%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the table, smoothing out the corners with the reverence of a surgeon. "Okay, Bill. The Henderson party. Saturday, two o'clock. Twenty kids, fifteen presents, one bouncy castle." "A bouncy castle?" Bill leaned in. "Why do I need to know about the bouncy castle?" "Because the bouncy castle is our way in." Bill stared at her. "You want me to bounce my way into a children's party." "No, you idiot. The bouncy castle creates a blind spot. The parents can't see behind it. That's where the present table is." "And how do you know all this?" "I cased the joint. I was there yesterday for the Patterson kid's party. Same backyard, same setup." "You went to a child's birthday party? Alone? A grown woman with no child?" Molly waved him off. "I said I was Aunt Linda. Nobody questions Aunt Linda. Everyone has an Aunt Linda." "That's actually disturbingly clever." "Now, here's the plan. You distract the parents." "How?" "You're the clown." Bill recoiled like she'd slapped him. "Absolutely not." "Bill—" "I have a phobia, Molly. You know this. I told you this in confidence." "You're afraid of clowns and you want to be a thief who steals from children. Do you hear yourself?" "It's different when I'm the clown! Then there are *two* of us!" Molly pinched the bridge of her nose. "Fine. You do the magic show instead. I'll be the clown." "You can't do magic." "It's children, Bill. I'll pull a quarter out of a kid's ear and they'll lose their minds." "And what'll I be doing?" "Loading the presents into the van while everyone's watching me ruin childhoods one card trick at a time." Bill scratched his chin. "The van. About the van. It says 'Happy Smiles Party Rentals' on the side." "Yeah. So we blend in." "It also says, in smaller letters underneath, 'Bill's Bail Bonds.'" Molly froze. "What?" "I bought it used. They only painted over half." "Bill." "I thought nobody reads the small print!" "At a children's party? Where suspicious parents have nothing to do but read small print on suspicious vans? *Bill.*" "We'll park it backward!" Molly took a deep, steadying breath. "Okay. New plan. We park backward. I do the clown. You do the magic. We grab the presents during the piñata." "Why the piñata?" "Because for ninety glorious seconds, twenty children will be swinging a bat with their eyes closed while sugar-crazed and screaming. Total chaos. Nobody watches the present table during a piñata." Bill nodded slowly. "That's beautiful, actually." "I know." "One question." "What?" "What do we do with fifteen children's presents? We can't fence a Barbie Dreamhouse, Molly." She paused. Looked at the blueprint. Looked at Bill. "...I hadn't thought that far ahead." "So we're stealing presents we can't sell, from children, while dressed as our two worst fears, in a van that advertises my failed bail bonds business." Molly rolled up the blueprint. "When you say it out loud, it loses something."
created_at
"2026-06-01T11:59:31.093Z"
6/1/2026, 11:59:31 AM
model
"anthropic/claude-4.8-opus-20260528"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1009
moderation_latency
(null)
generation_time
20280
tokens_prompt
77
tokens_completion
814
native_tokens_prompt
93
native_tokens_completion
1129
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.02869
router
(null)
provider_responses
0
endpoint_id
"b0d3beaa-5ff7-4666-a749-b1bb2b14c916"
id
"msg_015PceANyNfQTVcAHYg4MtRF"
is_byok
false
latency
1009
model_permaslug
"anthropic/claude-4.8-opus-20260528"
provider_name
"Anthropic"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.16.0; linux; x64))"
http_referer
(null)
request_id
"req-1780315171-xySAO58SFFfNfwEuQVZF"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1780315171-vGTWDpOVcaxQXAIBNuSr"
upstream_id
"msg_015PceANyNfQTVcAHYg4MtRF"
total_cost
0.02869
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
499
0.00%
Dialogue to Total Word Ratio
Ratio: 86.00%, Deviation: 16.00%
neededClean
false
wordsTotal
500
wordsDialogue
430
49.9996%