NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEA3WYKCR8G13NVB49SMJA4
Run Details
Model:
Claude Sonnet 5 (Reasoning, Low)
Test:
Dialogue tags
Scenario:
Write 500 words with 50% dialogue
Duration:
18.9s
Cost:
$0.011416
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8737%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The van reeked of stale coffee and desperation, which Molly figured was appropriate given their current career trajectory. She spread the blueprints across the dashboard—actual blueprints, because Bill had insisted on stealing them from the party planner's office instead of just asking for the address like a normal person. "Explain to me again," Molly said, "why we're robbing a seven-year-old's birthday party." "Because," Bill said, tapping the paper with unnecessary drama, "Timmy Henderson's parents are loaded. Tech money. You know how much stuff gets crammed into one of these parties?" "I know it's going to be a Bounce house situation, Bill. There will be children." "There will be presents, Molly. Presents unattended by responsible adults who are all distracted by mimosas and passive-aggressive conversations about preschool applications." Molly rubbed her temples. "This is different from banks. Banks don't have clowns." "I've accounted for the clown." "You've accounted for the clown." "Rodrigo. He's a friend of mine. Retired from clowning, actually, moved into balloon animal consulting, but he owes me a favor." "What kind of favor makes a man put on a clown suit again?" "The kind where I didn't tell his wife about Reno." Molly stared at him. "I don't want to know." "Good, because I wasn't going to tell you." She pulled the ski masks from her bag—small, unfortunately, because that's all the store had, and now they both looked like they were about to rob a preschool, which, well. "These masks make us look like we're stealing juice boxes." "We ARE stealing juice boxes. Among other things." "Bill. We are adults. We are stealing birthday presents from children." "We're stealing UNOPENED birthday presents from children who won't remember what they got in six months anyway. It's basically charity. For us." "That's not how charity works." "It's charity-adjacent." Bill unfolded a second sheet, this one covered in his terrible handwriting, arrows pointing every direction like a conspiracy theorist's fever dream. "Okay. Rodrigo distracts the parents with balloon swords. You and I slip into the gift room—there's always a gift room, it's very organized, rich people are weirdly organized about their chaos—and we load the bags." "And the getaway?" "Ice cream truck." Molly blinked. "We're stealing an ice cream truck?" "We're borrowing one. Temporarily. From my cousin Dave." "Does Dave know?" "Dave will know." "That's not the same thing, Bill." "Look, do you want the money or do you want to interrogate my logistics?" Molly considered this. Considered the mortgage payment sitting on her kitchen table with increasingly aggressive red stamps. Considered the fact that Bill's plans, despite everything, somehow always worked, in the same way a shopping cart with a bad wheel somehow always makes it to the checkout. "Fine," she said. "But if we get chased by a bounce house, I'm blaming you specifically." "Deal. Also, wear comfortable shoes. There might be a piñata." "Why would there be running involved with a piñata?" Bill just grinned, and started the van.
created_at
"2026-07-01T07:43:23.615Z"
7/1/2026, 7:43:23 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3819
moderation_latency
(null)
generation_time
18861
tokens_prompt
77
tokens_completion
874
native_tokens_prompt
93
native_tokens_completion
1123
native_tokens_completion_images
(null)
native_tokens_reasoning
39
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011416
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01SVMC5rVQKRVGcFqwdiHxnR"
is_byok
false
latency
2821
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891803-JNhHq5jv249YqSBQCR2y"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891803-Za6sQEVkeimQeM3wh2qF"
upstream_id
"msg_01SVMC5rVQKRVGcFqwdiHxnR"
total_cost
0.011416
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
496
0.00%
Dialogue to Total Word Ratio
Ratio: 64.40%, Deviation: 14.40%
neededClean
false
wordsTotal
500
wordsDialogue
322
49.8737%