NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTY6ZS9FPBSDS2THKE3M65
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
16.3s
Cost:
$0.026990
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.74%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.8722%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, elbowing aside a plate of congealing hash browns. The paper was covered in crayon. Red crayon. Some of it was shaped like a unicorn. "This is the best I could do," she said, defensive already. "Tommy's mom wouldn't let me into the actual house, so I had Tommy draw it from memory during recess." Bill stared at the document. There was a smiley sun in the corner. The front door was labeled DOOR. The backyard was labeled GRASS PLACE. "Tommy is six." "Tommy is seven, and he's very spatially gifted for his age." Bill pinched the bridge of his nose the way he did when Molly suggested things like *let's just pretend to be caterers* or *what if we dressed the getaway car as a bush*. He took a long, suffering sip of his coffee. "Walk me through it," he said. Molly brightened and produced a laser pointer from her jacket. Bill did not ask why she owned a laser pointer. He had learned. "Okay. So. The party starts at two. Twenty kids, one bouncy castle, a piñata shaped like Elsa—" "Who's Elsa." "Bill. *Bill.* We've had this conversation." "I don't keep up with the princesses." "She's the ice one. She's been the ice one for a decade." Molly clicked the laser at the crayon floor plan. "Presents go on this table, here, next to the GRASS PLACE. At three-fifteen, Tommy's mom does the cake. That's our window. Everybody sings, nobody's watching the loot, and we—" "We what." "We walk in dressed as clowns." Bill set his coffee down very carefully, the way a man sets down something he would otherwise throw. "No." "Bill." "*No.*" "Hear me out. A clown at a kid's party is invisible. It's camouflage. It's a ghillie suit of whimsy." "I'm not wearing the shoes." "You don't have to wear the shoes, you just have to wear the nose—" "I'm not wearing the nose." "Then you're the balloon guy. Balloon guys wear normal clothes and just, you know—" she made a twisting motion, "—swan, swan, giraffe, swan." "I can't make a balloon animal, Molly. I can barely tie my own shoes since the incident." "We don't talk about the incident." "*You* don't talk about the incident. I talk about it constantly. It comes up in therapy." Molly rolled up the blueprint with the decisiveness of a general. "Fine. New plan. You're the magician. Magicians get a cape, and a cape has *pockets.*" Bill considered this. A cape did have pockets. A cape had a great many pockets, if you were creative about the lining. "How many presents are we talking?" "Party favor bags included? Forty, maybe fifty units. Plus the big one from the grandparents, which Tommy says is, quote, *enormous and shaped like a horse.*" "I hate this job." "You love this job." He didn't answer, because she was right, and because he was already mentally measuring himself for a cape.
created_at
"2026-04-24T13:30:17.476Z"
4/24/2026, 1:30:17 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
816
moderation_latency
(null)
generation_time
16208
tokens_prompt
77
tokens_completion
760
native_tokens_prompt
98
native_tokens_completion
1060
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02699
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"725e17fd-b8d8-4e65-a7ee-7cf44057920f"
is_byok
false
latency
816
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037417-2r12BR8AS2LzDut8ouAH"
session_id
(null)
api_type
"completions"
id
"gen-1777037417-wKL1TS5f04jvkci9H1z6"
upstream_id
"725e17fd-b8d8-4e65-a7ee-7cf44057920f"
total_cost
0.02699
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.74%
Matches word count
n/a
neededClean
false
words
496
0.00%
Dialogue to Total Word Ratio
Ratio: 56.54%, Deviation: 26.54%
neededClean
false
wordsTotal
497
wordsDialogue
281
49.8722%