NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE8QM8KYYZR7H7QPX8PVRN4
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
17.7s
Cost:
$0.011886
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
98.71%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.3562%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the abandoned laundromat where Molly and Bill had set up their command center, which was really just a folding table and a stolen corkboard covered in photographs of a suburban backyard. "So let me get this straight," Bill said, squinting at the blueprint Molly had drawn in crayon. "We're robbing a seven-year-old's birthday party." "Not robbing," Molly said, offended. "Liberating. Those presents don't belong to anybody yet. They haven't even been opened." "That's literally the definition of belonging to someone, Molly." "Semantics." She jabbed a finger at the crayon map. "Here's the plan. Bounce house is the primary distraction. Once it deflates—" "Wait, you're planning to deflate the bounce house?" "I brought a knife." "You brought a *knife* to a *children's* party?" "For the bounce house, Bill, not the children. Try to keep up." Molly rolled her eyes and continued tracing the crayon lines with her finger. "While everyone's screaming about the bounce house, we slip in through the side gate dressed as clowns—" "Absolutely not. I refuse to be a clown." "You already refused to be a magician, a piñata, and 'the guy who brings extra napkins.' We're running out of costume options, Bill." "Because none of those are *jobs* people have! Why can't we just be caterers?" "Because caterers don't carry sacks, and we need sacks for the presents." Bill dragged a hand down his face. "Fine. Fine! I'll be a clown. But I'm not doing balloon animals. I don't know how to make balloon animals." "Nobody's asking you to make balloon animals, Bill, we're stealing presents, not running a kiddie side business—although," Molly paused, tapping her chin, "that's actually not a bad cover." "No. Absolutely not. I am not learning balloon animals for this heist." "You could learn one. Just a dog. Everyone loves a balloon dog." "I once tried to make a balloon dog and it exploded and a toddler cried for forty-five minutes. I still have nightmares." Molly sighed dramatically, the sound echoing off the empty washing machines. "Fine, no balloon animals. Focus. Once we're inside, you grab the presents from the gift table, I create a diversion by pretending to be the entertainment—" "Pretending to be *what* entertainment?" "I don't know, I'll figure it out! Maybe I'll juggle." "Can you juggle?" "How hard can it be? It's just throwing things and catching them, occasionally." "That's not juggling, Molly, that's just *panic*." "Same thing, different name." She rolled up the blueprint and tucked it under her arm like a general preparing for war. "The point is, by the time anyone notices the presents are gone, we'll be three blocks away splitting the loot." Bill stared at her for a long moment. "We are going to prison for stealing a seven-year-old's Nintendo Switch." "We are going to prison for stealing a seven-year-old's *joy*, Bill. Get it right." Molly grabbed her coat off the chair. "Now go find us some clown shoes. Preferably ones that don't squeak. I want this to be a *quiet* heist."
created_at
"2026-07-01T07:19:12.926Z"
7/1/2026, 7:19:12 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3145
moderation_latency
(null)
generation_time
17652
tokens_prompt
77
tokens_completion
871
native_tokens_prompt
93
native_tokens_completion
1170
native_tokens_completion_images
(null)
native_tokens_reasoning
62
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011886
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01PyDKkchg41kL8kAcRQpiYK"
is_byok
false
latency
1761
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890352-IfCIxeOp16WZZrh29MB0"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890352-rf3k4aE92MeRPI9QBWtG"
upstream_id
"msg_01PyDKkchg41kL8kAcRQpiYK"
total_cost
0.011886
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
98.71%
Matches word count
n/a
neededClean
false
words
506
0.00%
Dialogue to Total Word Ratio
Ratio: 75.05%, Deviation: 25.05%
neededClean
false
wordsTotal
513
wordsDialogue
385
49.3562%