NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE9HBTXDCJFT2VRBF714Q0S
Run Details
Model:
Claude Sonnet 5
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.6s
Cost:
$0.010776
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
66.59%
Dialogue to Total Word Ratio
(details)
82.9850%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The back room of Bill's apartment smelled like stale coffee and desperation, which Molly figured was appropriate given the plan currently spread across the table like a crime scene. "A kid's birthday party," Molly said, not for the first time. "You want us to rob a kid's birthday party." "Not the kid," Bill said, jabbing a finger at his crude drawing of a house that looked more like a lopsided potato. "The presents. There's a difference." "Is there?" "Molly. Molly, listen to me. This isn't some rando's kid. This is Bradley Fenwick's kid. You know who Bradley Fenwick is." "The guy who owns half the marina?" "The guy who owns half the marina. You think a man like that throws a birthday party with a clown and a bouncy castle? No. He throws a birthday party with a Rolex for a seven-year-old. With gift cards that have more zeroes than my last three jobs combined." Molly crossed her arms. "So we're stealing birthday money from a seven-year-old." "We're stealing birthday money from Bradley Fenwick. The kid's just collateral damage." "That's a terrible thing to say." "I know, I heard it as I was saying it, and yet here we are." Bill tapped the drawing again. "Look. Party starts at two. Ends at five. There's a two-hour window where every present that kid gets is sitting on a table in the backyard, completely unguarded, while forty drunk parents pretend to enjoy a magician." "How do we get past security?" "Costumes." "What kind of costumes?" "Clown costumes." Molly stared at him. "You want us to dress as clowns and rob a clown-themed birthday party." "It's not clown-themed, I just think clowns are a good disguise." "For a children's party. Where there is already going to be a real clown." Bill paused. "...Two clowns isn't that weird." "Bill." "Kids love clowns! Nobody questions a clown! We walk in, we juggle a little, we compliment the cake, we walk out with a sack full of stolen gift cards, everybody wins." "Except the kid." "Except the kid, yes, obviously, that's the whole point of the heist, Molly, keep up." Molly rubbed her temples. "And if the real clown notices there's suddenly two extra clowns at this party?" "We tell him we're with the second act." "There's a second act?" "There is now." She sat down slowly, dragging a chair across the linoleum with a screech that made Bill wince. "This is, hands down, the stupidest plan you've ever pitched me." "Worse than the yacht?" "Different kind of stupid than the yacht." "So you're in." "I hate that I'm in." Bill grinned and slid a red rubber nose across the table toward her. "Welcome aboard, Ms. Chuckles." Molly picked it up, turned it over in her fingers, and sighed the sigh of a woman who had made several bad decisions in her life and was about to make one more. "If we get arrested wearing clown costumes, I'm blaming you specifically. By name. To the judge." "Fair," Bill said. "Very fair."
created_at
"2026-07-01T07:33:16.264Z"
7/1/2026, 7:33:16 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1555
moderation_latency
(null)
generation_time
16522
tokens_prompt
77
tokens_completion
780
native_tokens_prompt
93
native_tokens_completion
1059
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010776
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_012psUDtmA975XgHCPzgquQ2"
is_byok
false
latency
1555
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782891196-FRYbxMjkJpLptZAg8qtx"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782891196-dHqFkXJsxiPamsQt2vbq"
upstream_id
"msg_012psUDtmA975XgHCPzgquQ2"
total_cost
0.010776
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
505
66.59%
Dialogue to Total Word Ratio
Ratio: 72.85%, Deviation: 2.85%
neededClean
false
wordsTotal
512
wordsDialogue
373
82.9850%