NC Bench
Overview
About
Tests
Models
Model
Claude Sonnet 4.5
Test
Dialogue tags
Scenario
Write 200 words with 10% dialogue
Run ID
2026-02-08-01KGYXVNBTWGS9SQ93R6AR6S8C
Model input
User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly studied the surveillance photos spread across the table, images of a suburban backyard festooned with balloons and streamers. Bill munched potato chips beside her, scattering crumbs across their operational intelligence. "Saturday at three," Molly said, tapping a balloon-shaped invitation. The target was innocent enough: little Timmy Henderson's eighth birthday party. But intelligence suggested the gifts would include the latest gaming console, still unavailable in stores, worth a fortune on the secondary market. Bill wiped grease on his jeans and leaned closer to examine the layout. The present table sat dangerously close to the bouncy castle, their primary infiltration point. Molly had already acquired two clown costumes from a bankrupt circus. "Parents will be drunk on mimosas by four," Bill noted. Their window was narrow. Enter during pin-the-tail-on-the-donkey, exit before cake. Molly had timed it perfectly during reconnaissance at the neighbor's party last week. She rolled up the photos and checked her watch. Twenty-three hours until showtime. Bill finished the chips and crumpled the bag loudly. Molly glared at him. Stealth was clearly not his strong suit, but he could fit through small windows, and that counted for something in this business.
created_at
"2026-02-08T15:28:02.794827+00:00"
Sunday, February 8th, 2026 at 3:28:02 PM GMT+00:00
model
"anthropic/claude-4.5-sonnet-20250929"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1516
moderation_latency
172
generation_time
8002
tokens_prompt
61
tokens_completion
248
native_tokens_prompt
75
native_tokens_completion
293
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.00462
router
(null)
provider_responses
0
id
"732e8a16-d180-4196-a8de-fa1d98df9495"
status
200
is_byok
false
latency
1516
endpoint_id
"d8bee848-f020-454a-b2be-4a7ce868f99f"
provider_name
"Amazon Bedrock"
model_permaslug
"anthropic/claude-4.5-sonnet-20250929"
api_type
"completions"
id
"gen-1770564474-WOh53rM9MG1yW94ZguWd"
upstream_id
"732e8a16-d180-4196-a8de-fa1d98df9495"
total_cost
0.00462
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
99.9990%
Matches word count
n/a
neededClean
false
words
199
96.0782%
Dialogue to Total Word Ratio
Ratio: 5.53%, Deviation: 4.47%
neededClean
false
wordsTotal
199
wordsDialogue
11
98.0386%