NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9FFEBWTZJ1A3906WJ6TZZ
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.3s
Cost:
$0.011786
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
0.01%
Matches word count
(details)
0.87%
Dialogue to Total Word Ratio
(details)
0.4386%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of Bill's van smelled like stale coffee and regret. Molly spread a crumpled birthday invitation across a pizza box, tapping it with a highlighter like it was a blueprint for Fort Knox. "Okay," she said. "Cody's turning seven. Superhero theme. Party's at two." Bill squinted at the invitation. "Cody. Who names their kid Cody." "Focus, Bill." "I'm focused. I'm just saying, if I ever have a kid, I'm naming him something respectable. Like Bartholomew." "You're not having kids, Bill. You once cried because a vending machine ate your dollar." "That machine owed me a Snickers." Molly rubbed her temples. "The presents will be stacked on the table by the cake. We go in disguised, we grab the bag, we're out before anyone notices." "Disguised as what?" "Clowns." Bill stared at her. "Clowns. At a superhero party." "Kids love clowns." "Kids are *terrified* of clowns, Molly, that's, like, a documented fact. There's a whole Stephen King book about it." "Fine. Superheroes, then. We dress as superheroes." "Which ones?" "I don't know, pick one." "I've always wanted to be Batman." "You don't have the jaw for Batman." "Excuse me?" "You have a very *soft* jawline, Bill. It's not an insult, it's bone structure." "I'll have you know women find my jaw very—" "We don't have time for your jaw's dating history. Pick a different hero." Bill crossed his arms. "Fine. I'll be Aquaman." "Nobody's scared of Aquaman." "We're not trying to scare anyone! We're trying to *blend in!*" "Right, but we also need authority. Gravitas. Someone the parents trust enough to let near the presents." "So... what, you're gonna be Wonder Woman?" Molly straightened, suddenly proud. "Obviously." "You don't have the arms for Wonder Woman." "Excuse me?!" "I'm just saying, those bracelets require *definition,* Molly." "Say that again and I will end you before this heist even starts." Bill raised his hands. "Just an observation!" Molly exhaled sharply, forcing herself back to the plan. "Okay. New approach. We don't dress as heroes. We dress as caterers. Nobody questions caterers." "I don't know how to caterer." "You just carry trays and look mildly annoyed. You already do that." "That's rude and accurate." "We walk in, we grab the presents under the guise of 'gift table maintenance,' we walk out. Clean. Simple. No capes." Bill considered this, tapping his chin. "What if one of the presents is, like, a bike. That's not subtle, Molly. That's not a 'slip it in a bag' scenario." "Then you carry the bike." "Like it's a tray?" "Like it's a tray, Bill." He sighed, staring at the ceiling of the van as though searching for strength. "You know, when I got into a life of crime, I pictured jewels. Yachts. Not doilies and juice boxes." Molly patted his shoulder. "Welcome to the big leagues, Aquaman."
created_at
"2026-07-01T07:32:14.424Z"
7/1/2026, 7:32:14 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
2811
moderation_latency
(null)
generation_time
18259
tokens_prompt
77
tokens_completion
819
native_tokens_prompt
93
native_tokens_completion
1160
native_tokens_completion_images
(null)
native_tokens_reasoning
44
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011786
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01AXMqwm3ziw7E72MwGgTXY7"
is_byok
false
latency
1624
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891134-BgfqfNvvdCAY1XwnE2WH"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891134-8PJggWppLU5m9YosYD12"
upstream_id
"msg_01AXMqwm3ziw7E72MwGgTXY7"
total_cost
0.011786
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
0.01%
Matches word count
n/a
neededClean
false
words
469
0.87%
Dialogue to Total Word Ratio
Ratio: 79.74%, Deviation: 9.74%
neededClean
false
wordsTotal
469
wordsDialogue
374
0.4386%