NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9FV8KNX53RJBJSYQMKK7F
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.7s
Cost:
$0.010756
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
90.22%
Dialogue to Total Word Ratio
(details)
45.1107%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of Bill's apartment smelled like pizza boxes and desperation, which Molly figured was appropriate given their current career trajectory. "A kid's birthday party," she said, staring at the crayon-drawn map Bill had somehow acquired. "We've hit banks, Bill. Jewelry stores. Remember that yacht?" "The yacht was a disaster." "The yacht was a disaster because YOU forgot boats float on water and move." "That's not the point." Bill jabbed the map with a highlighter, leaving a neon streak across what appeared to be a bounce house. "The point is this kid, Tyler, is turning seven, and his parents apparently invited the entire tri-state area. We're talking forty, fifty presents minimum. iPads. Drones. Probably a hoverboard." "He's seven." "Rich kids get hoverboards at seven, Molly. Welcome to capitalism." Molly pinched the bridge of her nose. "How did you even get this map?" "I told his mom I was the balloon guy." "Are you the balloon guy?" "I could be the balloon guy. I watched a YouTube tutorial." Bill produced a single, sad-looking balloon animal from his jacket pocket. It might have been a dog. It might have been a tumor. "That's horrifying." "It's a dachshund." "Bill." "Look, the plan is simple. I go in as the balloon guy, you go in as—" "As what? I'm not wearing a costume." "Face painter." "I don't know how to paint faces." "You don't know how to make balloon animals either, and yet." Molly grabbed the map, scanning the crayon scrawl. There was a table labeled PRESENTS in bubble letters, right next to something that might have been a piñata or possibly a demonic goat. "This says there's a bouncy castle guarding the loot." "Not guarding. Adjacent to." "Adjacent to is basically guarding when it's full of sugar-crazed seven-year-olds who bite." "They don't bite." "Bill, children absolutely bite. It's like their only natural weapon besides screaming." "Fine. We avoid the bouncy castle. We go in, I distract with balloons, you distract with terrible face paint, we grab the presents during cake time when everyone's distracted by the candles." "That's it? That's the whole plan? 'Cake time'?" "Cake time is sacred, Molly. Nobody's watching presents during cake time. It's basic heist physics." "There's no such thing as heist physics." "There's absolutely heist physics. I invented it. Just now." Molly stared at him for a long moment, then at the sad balloon dachshund, then back at the crayon map with its ominous goat-piñata. "We're stealing from seven-year-olds." "We're stealing FOR ourselves. The seven-year-olds are just... logistically involved." "I hate that this is my life." "You said that about the yacht too." "The yacht was a disaster." "We've established the yacht was a disaster." Bill handed her a smaller stack of crayons. "Now. Practice your unicorns. We leave at three."
created_at
"2026-07-01T07:32:26.525Z"
7/1/2026, 7:32:26 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1678
moderation_latency
(null)
generation_time
16650
tokens_prompt
77
tokens_completion
770
native_tokens_prompt
93
native_tokens_completion
1057
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010756
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01RVbTjynWCwsouDMLCWDzTZ"
is_byok
false
latency
1678
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891146-jvH2Q2KC32OiP0Wg46aH"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891146-zxLofQl6tVqG6JDI7fXx"
upstream_id
"msg_01RVbTjynWCwsouDMLCWDzTZ"
total_cost
0.010756
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
464
90.22%
Dialogue to Total Word Ratio
Ratio: 68.57%, Deviation: 1.43%
neededClean
false
wordsTotal
474
wordsDialogue
325
45.1107%