NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9RP9BM17QPVS1WMNFKK6H
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.2s
Cost:
$0.010496
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.92%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.9608%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the beat-up sedan parked outside 1428 Maple Street, where a banner reading "HAPPY 7TH BIRTHDAY TYLER!" hung soggy and drooping over the garage door. Molly checked her watch, then adjusted the ridiculous clown wig sitting on the dashboard like a dead poodle. "I still don't understand why I have to be the clown," Bill said, staring at the wig with the disgust of a man asked to eat something that had once been alive. "Because I'm doing face painting, obviously," Molly said, popping open a compact mirror to check her makeup. "You think I spent four years in art school to juggle badly? No. You're the clown. I'm the artist." "You went to art school for two semesters." "Details." Bill sighed and picked up the wig like it might bite him. "And the plan again? Just so we're clear. Because last time you said 'the plan' and I ended up hiding in a dumbwaiter for three hours." "The plan is simple," Molly said. "We go in, we do our jobs, kids have the best birthday party of their tiny lives, and while everyone's distracted by cake and a magic show gone slightly, deliberately wrong, we load the presents into the utility closet. Then we load the utility closet into the van." "Slightly wrong how?" "You're going to saw a woman in half. Badly. On purpose. It'll buy us ten minutes minimum while the parents scream." "There's no woman." "There's a mannequin, Bill. Try to keep up." Bill finally jammed the wig onto his head, and it sat there like a furious orange bird had died mid-flight. "This is humiliating." "You look great. Very trustworthy. Very 'I would never steal your children's Nintendo Switches.'" "That's exactly what I look like I'd do." "Perfect, then." Molly snapped the compact shut. "Now, remember — we've got exactly forty minutes between the pizza arriving and the parents getting suspicious about their car being blocked in. Forty minutes to paint faces, saw a lady in half, and clear out every present in that living room." "And if someone's kid recognizes us from the news?" "They're seven, Bill. They think the news is boring people talking. Nobody's kid is watching a segment on 'Suburban Theft Ring Targets Birthday Parties.'" "There's a suburban theft ring targeting birthday parties?" "There is now." She grinned, tucking a fake mustache into her makeup kit for good measure — just in case someone got too close. "Chin up, buttercup. In forty minutes we'll be halfway to the state line with a van full of Legos and a very ruined magic show behind us." Bill stared at his reflection in the rearview mirror, the wig sagging tragically over one eye. "I hate this plan." "You hate every plan." "I hate this one *specifically*." Molly opened her door, letting in a gust of rain and the distant sound of children screaming with joy, completely unaware of what was coming. "Get out of the car, Bonzo. We've got a party to crash."
created_at
"2026-07-01T07:37:16.34Z"
7/1/2026, 7:37:16 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1683
moderation_latency
(null)
generation_time
17192
tokens_prompt
77
tokens_completion
763
native_tokens_prompt
93
native_tokens_completion
1031
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010496
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01AZYmnAMmGiu4yTTp1Pp9A3"
is_byok
false
latency
1683
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891436-rpYWqpxW7B1o9tv1DKd3"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891436-OLn1t75Ng7ys6cYQCChv"
upstream_id
"msg_01AZYmnAMmGiu4yTTp1Pp9A3"
total_cost
0.010496
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.92%
Matches word count
n/a
neededClean
false
words
503
0.00%
Dialogue to Total Word Ratio
Ratio: 64.55%, Deviation: 14.55%
neededClean
false
wordsTotal
505
wordsDialogue
326
49.9608%