NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE8Z0RWTVYTJ409VN9QQW26
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
18.4s
Cost:
$0.011086
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
86.38%
Matches word count
(details)
49.74%
Dialogue to Total Word Ratio
(details)
68.0592%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back of the van smelled like old pizza and desperation, which Bill figured was appropriate given their line of work. "So the intel is solid," Molly said, spreading a crumpled piece of paper across her knees. "Timmy Henderson. Turning seven. Superhero theme." "Superhero theme," Bill repeated. "You want us to rob a bunch of six-year-olds dressed as Batman." "I want us to rob the presents, Bill. The presents are the target. The children are incidental." "Nothing about robbing children feels incidental." "We're not robbing children. We're robbing a table. A table with wrapped boxes on it. The children happen to be nearby, dancing to music that will haunt my nightmares." Bill rubbed his temples. "Remind me why we're doing this one." "Because Mrs. Henderson registered for a PlayStation, two hundred dollars in gift cards, and according to my source, an actual gold-plated Nintendo Switch from the wealthy grandmother." "Your source being—" "The clown. Bobo owed me money." "You're blackmailing a clown." "I'm collecting a debt from a clown. There's a difference." Bill stared out the windshield at the suburban house, balloons tied to the mailbox, a bounce castle inflating in the yard like some kind of vinyl monster waking up. "This feels different from the casino job." "Every job feels different from the casino job. That's why we do different jobs." "No, I mean—there's face paint involved, Molly. I don't do face paint." "You're not wearing face paint. You're wearing a costume." "What costume." Molly reached into the back seat and produced, with the flourish of a magician revealing a much worse trick, a spandex superhero suit in a shade of green Bill could only describe as "traffic cone." "Absolutely not." "You'll be Captain Justice. I made it myself." "I refuse." "The suit gets us inside, Bill. Parents love it when the entertainment shows up. We do a little routine, we mingle by the present table, we make our move during cake." "And what are you wearing?" Molly held up a sequined mask and a cape that had definitely once been a tablecloth. "The Sparkle Bandit. I'm improvising." "That's not even a real superhero." "Six-year-olds don't fact-check, Bill." He looked at the suit again, then at her, then back at the suit, the fabric practically glowing with bad decisions. "If anyone films this—" "No one's filming this. Everyone's filming Timmy." "If this ends with us on the news—" "It'll end with us four thousand dollars richer and never speaking of the tights again." Bill sighed, the long, suffering sigh of a man who had once robbed a bank vault with a nail file and somehow felt this was worse. He took the suit. "I want it on record that I hate this." "Noted. Now hurry up, cake's at three, and Bobo said the grandmother's driving separately with the Switch in her purse." "Bobo again." "Bobo's very reliable for a man who lies to children for a living."
created_at
"2026-07-01T07:23:15.114Z"
7/1/2026, 7:23:15 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
3091
moderation_latency
(null)
generation_time
18354
tokens_prompt
77
tokens_completion
833
native_tokens_prompt
93
native_tokens_completion
1090
native_tokens_completion_images
(null)
native_tokens_reasoning
46
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.011086
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01CHEzNWQVUb5bVqjfqKyKhL"
is_byok
false
latency
1898
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890595-HqWxN6wJnKMDcEMvsZ5I"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890595-rbqoTMJ6pXRi6TrgfDpk"
upstream_id
"msg_01CHEzNWQVUb5bVqjfqKyKhL"
total_cost
0.011086
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
86.38%
Matches word count
n/a
neededClean
false
words
489
49.74%
Dialogue to Total Word Ratio
Ratio: 66.26%, Deviation: 3.74%
neededClean
false
wordsTotal
495
wordsDialogue
328
68.0592%