NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-07-01-01KWE96QK13Q0MPA466JV8KBDT
Run Details
Model:
Claude Sonnet 5 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.6s
Cost:
$0.010736
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
81.27%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
40.6364%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
The rain hammered against the window of the Dingy Spoon Café, and Molly Fitzgerald hunched over a crumpled napkin like it held the secrets of the universe. In truth, it just had a badly drawn floor plan of a suburban house, complete with a stick figure labeled "BOUNCY CASTLE" and another labeled "CAKE - DO NOT TOUCH UNTIL AFTER." Bill Hutchins slurped his coffee with all the grace of a man who had never once considered that other people existed. "So let me get this straight," he said, wiping foam from his mustache. "We're robbing a seven-year-old's birthday party." "Timmy Baxter's birthday party," Molly corrected, as if the specificity made it more dignified. "And we're not robbing Timmy. We're robbing his parents. Rich parents. Parents who apparently bought their kid a drone collection worth more than my car." "Your car is a bicycle." "Exactly my point." Bill leaned back, the chair creaking under the weight of his skepticism. "There's gonna be, what, twenty kids there? Parents? Maybe a clown?" "Magician," Molly said. "I checked the invitation. Posted it on the community board like an idiot." "Even worse. Magicians notice things. It's their whole deal." "He does card tricks, Bill, not surveillance." She tapped the napkin. "Here's the plan. You dress as a party entertainer—" "Absolutely not." "—a very tall, very intimidating balloon artist—" "Molly." "—and you distract everyone in the backyard while I slip inside and load the presents into the laundry hamper I'm bringing." Bill stared at her. "A laundry hamper." "It has wheels." "Molly, it's a child's birthday party. There will be, at minimum, four mothers who have never once looked away from their phones except to take pictures of said children. You cannot roll a hamper full of stolen Nintendo Switches past a woman live-streaming for her forty-two followers." "Then you'll have to be *really* distracting." "I don't know how to make balloon animals." "Nobody does, Bill. That's the beauty of it. You just twist things aggressively and call it a giraffe." He rubbed his temples, already regretting every decision that led him to this booth, this napkin, this woman. "And what if Timmy notices his presents are gone? Kids notice things. Especially about presents." "We leave one gift. Something boring. Socks, maybe. A book about sharing." "You want to traumatize a child with disappointment as a cover story?" "I want to retire, Bill. This is retirement money. Do you know what people spend on children's parties these days? It's obscene. It's basically asking to be robbed." Bill was quiet for a moment, staring into his coffee like it might offer a better plan. It didn't. "Fine," he said finally. "But I'm not doing a giraffe. I'll do a sword. Everyone loves a balloon sword." "That's the spirit," Molly said, already folding the napkin into her pocket like a woman who had just closed a business deal instead of plotting a felony against toddlers.
created_at
"2026-07-01T07:27:27.854Z"
7/1/2026, 7:27:27 AM
model
"anthropic/claude-sonnet-5-20260630"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
1798
moderation_latency
(null)
generation_time
16607
tokens_prompt
77
tokens_completion
781
native_tokens_prompt
93
native_tokens_completion
1055
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
service_tier
"default"
usage
0.010736
router
(null)
provider_responses
0
endpoint_id
"37b16f45-eeae-45f3-9b85-2137f7c6d61b"
id
"msg_01WQTxbPJTrxnSDSR1WXgNkx"
is_byok
false
latency
1795
model_permaslug
"anthropic/claude-sonnet-5-20260630"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.17.0; linux; x64))"
http_referer
(null)
request_id
"req-1782890847-srZPQuXBCuUbv1lhy1Dq"
session_id
(null)
preset_id
(null)
api_type
"completions"
id
"gen-1782890847-dw9ZKruoB6ky68V4Swxz"
upstream_id
"msg_01WQTxbPJTrxnSDSR1WXgNkx"
total_cost
0.010736
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
data_region
"global"
Evaluation details
Result
Evaluator
Details
Meta Data
81.27%
Matches word count
n/a
neededClean
false
words
488
0.00%
Dialogue to Total Word Ratio
Ratio: 58.54%, Deviation: 28.54%
neededClean
false
wordsTotal
492
wordsDialogue
288
40.6364%