NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-02-08-01KGYZTR4T5R0Z20W7AWQ50DHG
Run Details
Model:
Claude Sonnet 4.5
Test:
Dialogue tags
Scenario:
Write 200 words with 50% dialogue
Duration:
7.2s
Cost:
$0.004650
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
20.19%
Matches word count
(details)
0.12%
Dialogue to Total Word Ratio
(details)
10.1541%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly pressed her face against the community center window. Inside, twenty screaming children attacked a piƱata shaped like a unicorn. "That's a lot of witnesses," Bill whispered. "They're six years old. Who's going to believe them?" Molly pulled out a crumpled invitation from her pocket. "The presents are in the back room. We go in during cake time." "What if someone asks who we are?" "We're the magicians." Bill glanced down at his black turtleneck and cargo pants. "Do we look like magicians?" "Fine. We're the backup magicians. The emergency magicians." Molly checked her watch. "Three minutes until candles. You ready?" "I still think this is our worst idea yet." "Worse than the dog show?" "I had to get rabies shots, Molly." "That was one bite. Stop being dramatic." She straightened her turtleneck. "Besides, we need this. Rent's due and that medieval fair job was a bust." "Maybe we should get real jobs." "And give up on our dreams?" Molly grinned. "Come on. Those presents aren't going to steal themselves." The birthday song erupted inside. Molly yanked open the door.
created_at
"2026-02-08T16:02:29.245406+00:00"
2/8/2026, 4:02:29 PM
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1265
moderation_latency
169
generation_time
7013
tokens_prompt
61
tokens_completion
251
native_tokens_prompt
75
native_tokens_completion
295
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00465
router
(null)
provider_responses
0
id
"b0611e67-292f-454a-a8b7-fa179721193a"
status
200
is_byok
false
latency
1265
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770566541-mGIHWj66ueXBGbLK1Rv5"
upstream_id
"b0611e67-292f-454a-a8b7-fa179721193a"
total_cost
0.00465
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
20.19%
Matches word count
n/a
neededClean
false
words
180
0.12%
Dialogue to Total Word Ratio
Ratio: 66.11%, Deviation: 16.11%
neededClean
false
wordsTotal
180
wordsDialogue
119
10.1541%