NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE967FH9RZZ7MVFGRBH8C9W
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.9s
Cost:
$0.010606
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
95.99%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
47.9934%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the van, which smelled like old fries and Bill's cologne, a combination that had never once improved with age. Molly sat cross-legged on a milk crate, studying a crayon drawing of a birthday cake like it held the secrets of the universe. "Explain to me again," Bill said, "why we're casing a seven-year-old's birthday party." "Presents, Bill. Do you know how many presents a kid gets at a party with forty invitees? We're talking Nintendo Switches. Drones. That one kid probably owns a hoverboard." "He's turning seven, not knighting himself." "Rich parents overcompensate. It's basic psychology." Bill rubbed his temples like he could massage the bad decisions right out of his skull. He'd known Molly for eleven years, ever since she picked his pocket at a state fair and then bought him a corndog with his own money out of guilt. Somehow that had seemed charming at the time. "There will be children there, Molly. Actual small humans. With parents. And probably a magician." "A magician works in our favor. Misdirection is literally his whole job." "That's not—" Bill stopped, because arguing logic with Molly was like arguing with a raccoon about garbage. Technically winnable, but not worth the scratches. "Here's the plan," she said, unfolding a hand-drawn map that looked suspiciously like it had been sketched by someone's actual first grader, complete with a stick figure labeled BILL DUMMY. "We dress as clowns." "Absolutely not." "We dress as clowns, we do a little juggling, some balloon animals, classic misdirection, and while everyone's watching us make a poodle out of latex, we slip the gift table into these." She produced two enormous duffel bags with the enthusiasm of a woman unveiling the Mona Lisa. "Those presents are wrapped, Molly. Wrapped presents make noise. Wrapped presents have name tags. Wrapped presents belong to seven-year-olds who will absolutely notice." "Which is why we go in fast, go out faster, and by the time anyone realizes the gift table's empty, we're three counties over eating pancakes." "You've thought about the pancakes but not the getaway route?" "I think about pancakes first. Priorities, Bill." He stared at the clown wig she'd already produced from somewhere—God knew where, the woman kept props like other people kept spare change—and felt the familiar, sinking sensation of a bad idea gaining momentum. "If I say yes," he said slowly, "we are not, under any circumstances, taking the cake." "The cake wasn't even on the list." "It's on your list. I can see it circled in red." "That's for after. Personal use." "Molly." "Bill, focus. The bounce house alone tells me there are at least four unsupervised gift bags near the exit. We walk in, we juggle, we walk out rich. Clowns aren't suspicious, Bill. Clowns are memorable for the wrong reasons." He picked up the wig anyway, defeated, and somewhere in the back of his mind, he was already tasting pancakes.
created_at
"2026-07-01T07:27:11.355Z"
7/1/2026, 7:27:11 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1571
moderation_latency
(null)
generation_time
16868
tokens_prompt
77
tokens_completion
781
native_tokens_prompt
93
native_tokens_completion
1042
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010606
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01BbCnrsQYD7Sp9ELKQxUR7B"
is_byok
false
latency
1570
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890831-CqZgKWCUMH7dLAIAti5P"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890831-x7MEqUxq7rdSwmihzdZs"
upstream_id
"msg_01BbCnrsQYD7Sp9ELKQxUR7B"
total_cost
0.010606
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
95.99%
Matches word count
n/a
neededClean
false
words
492
0.00%
Dialogue to Total Word Ratio
Ratio: 53.82%, Deviation: 23.82%
neededClean
false
wordsTotal
498
wordsDialogue
268
47.9934%