NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWEA3QAX2B1M9DV7GH9SZZ1N
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
8.7s
Cost:
$0.004806
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
100.00%
Matches word count
(details)
97.17%
Dialogue to Total Word Ratio
(details)
98.5826%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Okay, so the intel says the kid's turning seven," Molly said, spreading a crumpled birthday invitation on the table. "Seven presents guaranteed minimum, easy." "You got the invitation from the trash again, didn't you," Bill said. "Dumpster diving is a skill, Bill." "It's a cry for help is what it is." "Focus. We go in disguised as the entertainment. I'll be the balloon lady, you're the magician." "Why do you get to be the balloon lady? Balloon animals are basically art. I should be the balloon lady." "You can't even tie a knot, Bill. You proposed to your ex-wife with a shoelace." "That was romantic!" "It was pathetic. You're the magician. Wear the cape." "I hate the cape." "The cape has pockets, Bill. Pockets for presents." "...Fine. But I get top billing on the flyer." "There's no flyer, this is a heist." "Every heist needs a flyer, Molly, that's just professionalism." "We are stealing gifts from a seven-year-old." "With style." "With a cape you hate." "I'm growing into the cape." "You're forty-one." "Growth doesn't stop at a number, Molly." "Just grab the presents and don't do actual magic tricks, last time you set the tablecloth on fire." "That was one time."
created_at
"2026-07-01T07:43:17.864Z"
7/1/2026, 7:43:17 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1590
moderation_latency
(null)
generation_time
8629
tokens_prompt
77
tokens_completion
332
native_tokens_prompt
93
native_tokens_completion
462
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.004806
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01ENaMWjPu9dEv5wgVs4oQRa"
is_byok
false
latency
1590
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891797-OaGjxzMSQ2nIambruvz9"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891797-2nHLnTsHJD4pq7GUA45c"
upstream_id
"msg_01ENaMWjPu9dEv5wgVs4oQRa"
total_cost
0.004806
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
100.00%
Matches word count
n/a
neededClean
false
words
201
97.17%
Dialogue to Total Word Ratio
Ratio: 94.12%, Deviation: 4.12%
neededClean
false
wordsTotal
204
wordsDialogue
192
98.5826%