NC Bench
Overview
About
Tests
Models
Model
Claude Opus 4.6
Test
Dialogue tags
Scenario
Write 500 words with 70% dialogue
Run ID
2026-02-08-01KGYYV5VZRQ987JNQ899EBPXS
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the hand-drawn map across the sticky diner table and tapped her finger on a crude rectangle labeled BOUNCE HOUSE in red crayon. "Alright, Bill, listen up. The party starts at two. We've got a forty-five minute window between the piƱata and the cake ceremony to grab the presents and get out clean." Bill squinted at the map, turning it sideways. "Why is everything drawn in crayon?" "Because I stole the map from the birthday boy's older sister. She's nine. Remarkably detailed work, actually. She even marked the location of the cooler with the juice boxes." "We're robbing a six-year-old's birthday party, Molly." "We're robbing fourteen six-year-olds' parents, technically. Do you know what kind of gifts show up to these things now? We're talking brand-name tablets, designer sneakers, those little electric cars kids drive around in. Last year Tommy Marino got a drone. A drone, Bill. The kid still eats glue." Bill rubbed his temples. "And the security?" "One dad who volunteers to man the grill. He'll be on his fourth beer by two-thirty. Oh, and there's a clown." "A clown." "Goes by Sprinkles. He does balloon animals and light magic. Honestly, he might be our biggest obstacle. I've seen his work. He commands the attention of every child in a thirty-foot radius. If he spots us near the gift table, he could redirect the whole herd." "So what's the plan for Sprinkles?" "You're going to distract him." "Absolutely not." "Bill, I need you to walk up to Sprinkles and request an elaborate balloon sculpture. Something complex. A giraffe riding a motorcycle. A full-scale replica of the Titanic. Whatever buys me three minutes at the gift table." "And you're just going to walk off with an armful of wrapped presents in broad daylight?" "That's what the uniform is for." Molly reached under the table and pulled out a purple polo shirt with a cartoon kangaroo embroidered on the chest. "Party Palooza Event Staff. I printed the logo myself. I walk in with a clipboard, tell the mom there's been a gift recall, load everything into the van, and we're gone before the candles are lit." "A gift recall? That's the dumbest thing I've ever heard." "You haven't met this mom. She's been posting about product safety dangers on Facebook for six straight months. She'll practically help me carry the boxes." Bill stared at the crayon map for a long moment. A waitress refilled their coffee without making eye contact. "What about the getaway vehicle?" "Borrowed an ice cream truck. Nothing suspicious about an ice cream truck near a kids' party. Plus it plays music, which covers engine noise." "Does it have ice cream in it?" "Twelve flavors. We can sell them at the flea market after we fence the gifts." Bill picked up the crayon map and examined the nine-year-old's meticulous floor plan one more time. "I want sixty percent." Molly grinned and slid a purple polo across the table. "You'll get fifty and a juice box. Now let's go rob some children."
created_at
"2026-02-08T15:45:29.249208+00:00"
Sunday, February 8th, 2026 at 3:45:29 PM GMT+00:00
model
"anthropic/claude-4.6-opus-20260205"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1417
moderation_latency
(null)
generation_time
21977
tokens_prompt
61
tokens_completion
663
native_tokens_prompt
75
native_tokens_completion
765
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.0195
router
(null)
provider_responses
0
id
"msg_019YkL7hhpRUMJM5UiJ8KcVY"
status
200
is_byok
false
latency
1417
endpoint_id
"c926c048-563e-44b0-8434-95965bee924c"
provider_name
"Anthropic"
model_permaslug
"anthropic/claude-4.6-opus-20260205"
api_type
"completions"
id
"gen-1770565506-FmUXBzrCwE85Y4qK3e56"
upstream_id
"msg_019YkL7hhpRUMJM5UiJ8KcVY"
total_cost
0.0195
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Anthropic"
Evaluation details
Result
Evaluator
Details
Meta Data
51.9255%
Matches word count
n/a
neededClean
false
words
516
0.7858%
Dialogue to Total Word Ratio
Ratio: 79.84%, Deviation: 9.84%
neededClean
false
wordsTotal
516
wordsDialogue
412
26.3557%