NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE93RHW59V4ST57B63GSFZ1
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
24.7s
Cost:
$0.010236
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
2.01%
Matches word count
(details)
0.29%
Dialogue to Total Word Ratio
(details)
1.1485%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly slid into the booth across from Bill, dropping a manila folder on the table like she was starting a poker game with a straight flush. "Tell me you didn't," Bill said, not even looking up from his coffee. "I didn't what?" "You didn't case a six-year-old's birthday party." "Timmy Radcliffe turns seven, actually, and I resent the implication that this is some kind of low-brow operation." Molly flipped open the folder. "Forty-three presents. Forty-three, Bill. His grandmother alone sent six boxes and one of them is the size of a mini-fridge." Bill finally looked up. "We're stealing birthday presents from a seven-year-old." "We're liberating consumer goods from a household that clearly has more money than sense." "That's not liberating, that's just stealing." "Semantics." She tapped the folder. "Security is a father who thinks a Ring doorbell makes him Fort Knox, and a bouncy castle rental company that leaves the generator unattended for twenty-minute stretches." "A bouncy castle." "A big one. Dinosaur themed." Bill rubbed his temples. "How do we even get in? You can't exactly waltz into a kid's party wearing a ski mask." "Obviously not. I already ordered a clown costume." "You're going to be the clown." "I'm going to be the entertainment, Bill. Nobody questions the guy with the balloon animals. I'll do a set, keep the kids distracted in the backyard, and you slip in through the side door disguised as party staff." "Disguised as what staff? There's no staff." "There's a caterer. I saw the van. You're going to be the caterer's assistant who got sent to grab extra folding chairs from the garage, except the garage is actually where they're stashing the gift table until the cake." Bill stared at her. "You've thought about this way too much." "I think about everything too much, that's why we're not in prison." "We were in prison. Twice." "Once was a misunderstanding and once was your fault." Molly slid a photo across the table — a wide shot of a backyard, a table groaning under wrapped boxes, streamers, the whole suburban circus. "Look at that. Look at it, Bill. That's a Nintendo Switch box right there, corner of the table. Somebody's getting a Switch." "He's seven, he doesn't need our professional attention." "He doesn't need forty-three presents either but here we are, righting the balance of the universe." Bill exhaled slowly, already knowing he'd lost. "If I get arrested in a hairnet, I swear to God, Molly." "You won't. In and out in eleven minutes, load it into the van before the pin the tail on the donkey game even wraps up." "It's pin the tail on the *dinosaur*, apparently, given the theme." "See, you're already invested." She snapped the folder shut, grinning. "Wear something with pockets. Big ones. For the Switch." "I hate you." "You love me. Now go find a hairnet."
created_at
"2026-07-01T07:25:50.536Z"
7/1/2026, 7:25:50 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
9263
moderation_latency
(null)
generation_time
24675
tokens_prompt
77
tokens_completion
755
native_tokens_prompt
93
native_tokens_completion
1005
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010236
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01LvP7Q9z6hYovG41SDSyHJE"
is_byok
false
latency
9263
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890750-ZkH47iaDXQrCf9t2jE5N"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890750-rVMooXEDIXg86DjArA3T"
upstream_id
"msg_01LvP7Q9z6hYovG41SDSyHJE"
total_cost
0.010236
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
2.01%
Matches word count
n/a
neededClean
false
words
475
0.29%
Dialogue to Total Word Ratio
Ratio: 80.82%, Deviation: 10.82%
neededClean
false
wordsTotal
485
wordsDialogue
392
1.1485%