NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9JQECMQTNNBN1363EW1VC
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 200 words with 90% dialogue
Duration:
9.0s
Cost:
$0.004866
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
51.17%
Dialogue to Total Word Ratio
(details)
75.2714%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
"Okay, so we go in through the bounce castle," Bill said. "Through the *bounce castle*? Bill, that's a physical barrier made of air. You don't sneak through a bounce castle, you bounce through it, loudly, while eight-year-olds scream." "They'll think I'm entertainment." "You're forty-seven and you smell like a gas station burrito." "Kids love burritos." "Nobody loves burritos that much. Focus. The presents are on the table by the cake. We grab the bag, we're out in ninety seconds." "Ninety seconds! Molly, do you know how many presents an eight-year-old gets these days? It's obscene. There's probably a drone in there." "Which is exactly why we're stealing it." "Right, right. Okay, but what about the clown?" "What clown?" "The invitation said 'Zippy the Clown, 3pm.' It's 2:58." "So we've got two minutes before some man named Zippy shows up and ruins everything." "Or helps us. Clowns carry big bags too. Nobody questions a second bag." "We are not disguising ourselves as clowns, Bill." "I already bought the noses." "You bought—Bill, we don't have time for noses!" "There's always time for noses, Molly. That's rule one." "That is not a rule. That has never been a rule." "It's my rule." "Just get in the car."
created_at
"2026-07-01T07:34:00.918Z"
7/1/2026, 7:34:00 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1763
moderation_latency
(null)
generation_time
9004
tokens_prompt
77
tokens_completion
340
native_tokens_prompt
93
native_tokens_completion
468
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.004866
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_018wQcfKGzMNxx2PED823yo8"
is_byok
false
latency
1762
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891240-teMDcqMEIGqsSwvE6Nm6"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891240-MNxMRgzYW19uVIKjkNEW"
upstream_id
"msg_018wQcfKGzMNxx2PED823yo8"
total_cost
0.004866
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
205
51.17%
Dialogue to Total Word Ratio
Ratio: 99.05%, Deviation: 9.05%
neededClean
false
wordsTotal
210
wordsDialogue
208
75.2714%