NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEA2YMETAHYXF06QJRH5AH7
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.7s
Cost:
$0.011186
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
93.65%
Matches word count
(details)
11.23%
Dialogue to Total Word Ratio
(details)
52.4401%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
# The Great Birthday Heist Molly slammed the blueprint down on the table, nearly knocking over Bill's coffee. "Twelve presents, Bill. Twelve. All wrapped in that shiny paper that makes noise when you breathe near it." "So we breathe less." "You can't breathe less, Bill. That's not how breathing works." "I held my breath for two minutes once. During the Hendricks job." "You passed out during the Hendricks job." "I was resting my eyes." Molly rubbed her temples. "The party starts at three. Bounce house goes up at two-thirty. That's our window." "What about the clown?" "What clown?" "Every kid's party's got a clown, Molly. It's basic heist reconnaissance. You don't know your clown, you don't know your enemy." "There's no intel suggesting a clown." "That's exactly what a clown would want you to think." Molly stared at him for a long moment. "We're stealing birthday presents from seven-year-olds, and you're worried about a hypothetical clown." "I've got a thing about clowns." "Since when?" "Since Cleveland." "We've never been to Cleveland." "Exactly." Bill leaned back, arms crossed, like this settled everything. Molly decided not to pursue that thread. "Okay. Focus. I go in disguised as the balloon artist. You're on present extraction, disguised as—" "The dad who forgot to bring a gift and is now awkwardly hovering by the snack table." "That's not a disguise, Bill, that's just a personality trait you already have." "It's method acting." "Fine. Method-act your way to the gift table. Grab everything, stuff it in the bag, we're out before the piñata even cracks." Bill picked up a balloon from the prop bag and squeaked it thoughtfully. "What if a kid asks me to make a balloon dog?" "You're not the balloon artist. I'm the balloon artist." "But what if a kid asks *me*?" "Then you say you specialize in balloon *cats* and walk away." "Balloon cats aren't a thing." "They are now." Bill considered this, nodding slowly, as if Molly had just revealed some deep truth of the universe. "What's the getaway?" "Ice cream truck. Parked two blocks down. We load the presents, we drive off, we sell novelty popsicles to confused children for exactly four minutes as cover." "I love this plan." "You say that about every plan." "I loved the plan where we dressed as mascots for the mattress store." "That plan got us chased by actual security for six blocks." "Character building." Bill stood, stretching, cracking his knuckles with unnecessary drama. "One question. The cake. Are we talking chocolate, or—" "We are not stealing the cake, Bill." "I didn't say steal. I said *taste*. For quality control. What if the cake is evidence?" "Evidence of what?" "Deliciousness. Someone should verify it before we vanish into the night." Molly grabbed the blueprints, folding them with a sigh that carried the weight of every heist she'd ever planned with this man. "Fine. One bite." "One bite," Bill agreed, already reaching for his balloon-cat gloves.
created_at
"2026-07-01T07:42:52.569Z"
7/1/2026, 7:42:52 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1418
moderation_latency
(null)
generation_time
16662
tokens_prompt
77
tokens_completion
818
native_tokens_prompt
93
native_tokens_completion
1100
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011186
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_011agFeEAFmqdtUuaFCnFkGs"
is_byok
false
latency
1418
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891772-hNJZ8iOclEyvQWWpxzX6"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891772-hEFo4DXFZD1O56ZAjY5f"
upstream_id
"msg_011agFeEAFmqdtUuaFCnFkGs"
total_cost
0.011186
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
93.65%
Matches word count
n/a
neededClean
false
words
491
11.23%
Dialogue to Total Word Ratio
Ratio: 76.61%, Deviation: 6.61%
neededClean
false
wordsTotal
496
wordsDialogue
380
52.4401%