NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9STB2B0N684HXZZQD0VWP
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.2s
Cost:
$0.010976
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.00%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
0.0000%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The warehouse smelled like stale popcorn and desperation, which Molly figured was appropriate given their target. She spread the blueprints across the card table, weighted down on one corner by a half-empty can of Red Bull. "So here's the layout," she said, tapping the paper. "Kid's turning seven. Party's in the backyard. Presents are on a table by the bounce house." Bill leaned over, squinting. "This is a drawing of a house made of crayons." "It's to scale." "It says 'Timmy's Awesome Party' in glitter glue." "I had to improvise, okay? I couldn't exactly ask the parents for architectural plans." Bill straightened up, arms crossed. "Molly, we're planning to rob seven-year-olds." "We're not robbing the kids. We're robbing the presents. The kids don't even own them yet. Legally speaking, it's basically shoplifting from Toys R Us with extra steps." "That's not how legality works." "Do you want the vintage comic book collection or not? Timmy's uncle apparently has money and terrible gift-wrapping skills." Bill hesitated, which Molly recognized as the sound of a man doing moral math and losing. "Fine. What's the plan?" "We dress as clowns." "Absolutely not." "Bill." "I did clowns last time. I still have nightmares about that unicycle." "Fine, we dress as the bounce house delivery guys. Nobody questions bounce house guys. They're basically invisible, like mall Santas or DMV employees." Bill considered this. "That could work. Where do we even get uniforms?" "I stole two from an actual bounce house company last week." "You planned ahead? Who are you?" "I contain multitudes, Bill." He picked up the crayon drawing again, turning it sideways as if that might help. "So we walk in, take the presents, walk out. That's it?" "That's it. Sixty seconds, tops. In, out, no eye contact with anyone under four feet tall." "What if a kid asks us something?" "Kids don't ask bounce house guys questions. They ask if they can go in the bounce house. You say 'not yet, buddy,' and they lose interest immediately. It's foolproof." "And the parents?" "Parents at a seven-year-old's birthday party are either drunk on mimosas or negotiating a custody-adjacent argument about whose turn it is to watch the cake. Nobody's watching us." Bill sat down slowly, defeated by logic he didn't like. "Why do I feel like this is somehow worse than the bank job." "Because banks have security cameras and this has a inflatable castle full of unsupervised children. Higher stakes, lower dignity." "Great. Love that for us." Molly folded the crayon map and slipped it into her jacket. "We go in twenty minutes. Wear the uniform. Smile a lot. And Bill—" "What?" "Do not, under any circumstances, agree to do the limbo." "When has that ever come up?" "It always comes up, Bill. It always comes up."
created_at
"2026-07-01T07:37:53.262Z"
7/1/2026, 7:37:53 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2491
moderation_latency
(null)
generation_time
17170
tokens_prompt
77
tokens_completion
792
native_tokens_prompt
93
native_tokens_completion
1079
native_tokens_completion_images
(null)
native_tokens_reasoning
37
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010976
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01RP9VZP2A3gvtB5uSdTFD4N"
is_byok
false
latency
1591
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891473-FnyJ88NnbZInXESGx7aO"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891473-tK6zaYjSC22Ur41fNGGj"
upstream_id
"msg_01RP9VZP2A3gvtB5uSdTFD4N"
total_cost
0.010976
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.00%
Matches word count
n/a
neededClean
false
words
464
0.00%
Dialogue to Total Word Ratio
Ratio: 77.49%, Deviation: 27.49%
neededClean
false
wordsTotal
471
wordsDialogue
365
0.0000%