NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTWS9PXY9K18E6884VTS05
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.2s
Cost:
$0.028940
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
14.30%
Matches word count
(details)
3.57%
Dialogue to Total Word Ratio
(details)
8.9369%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the sticky diner table, nudging aside a plate of half-eaten pancakes. Bill squinted at the crayon drawings. "These are the schematics?" he asked. "My niece drew them. She's seven. She's been to the house." "Molly, there's a unicorn on the roof." "That's a satellite dish. Focus, Bill." Bill traced a shaky line with his finger. "And this?" "Back entrance. Leads through the kitchen." "It says 'dragon lair.'" "That's the dad's office. He has a beard. Look, the point is, we go in through here, past the bouncy castle, grab the gift table, and we're out in ninety seconds." Bill leaned back, arms crossed. "I'm not robbing a seven-year-old, Molly." "She's turning eight. And we're not robbing her. We're relocating unopened merchandise." "That's the same thing with extra words." "Bill. Do you know what kids get at birthday parties these days? iPads. Drones. Actual gold-plated fidget spinners. Last month, some kid in Scarsdale got a pony." "We're stealing a pony?" "We're not stealing a pony, Bill, we can't fit a pony in the Civic." "Then what's the plan?" Molly pulled out a clown nose and a floppy red wig. "You're Giggles." "Absolutely not." "I'm Sparkles. I do the balloon animals." "You can't do balloon animals." "I can do a snake." "A snake is just a balloon, Molly." "It's a long balloon. That's the whole trick." She tossed him the wig. "You distract the kids with magic while I load the gifts into the van." "I don't know magic." "You know that card trick." "The one where I drop the cards and swear?" "The kids'll love it. Kids love chaos." Bill held the wig between two fingers like it was radioactive. "And the parents?" "Sheet cake. Nobody pays attention to anything when there's sheet cake." "You're basing an entire operation on sheet cake." "I'm basing it on human nature, Bill. Also piñatas. There's a piñata at two-thirty. That's our window." "Why the piñata?" "Because when that thing bursts, it's every man for himself. Kids hit the floor like it's Black Friday. Parents are filming. Grandma's in tears. Nobody's watching the gift table. We grab, we go." Bill chewed his pancake slowly. "And if we get caught?" "By who? A six-year-old? What's she gonna do, sue us?" "The dad, Molly. The dragon-lair dad." "He's a dentist." "Dentists are strong, Molly. They wrestle molars." Molly sighed and grabbed his hand across the table. "Bill. Listen. We do this one job, we're set till Christmas. Then we hit a bar mitzvah." "A bar mitzvah?" "That's where the real money is. Envelopes, Bill. Cash envelopes." Bill looked down at the crayon blueprint, at the unicorn-satellite, at the dragon-lair, at the smiling stick-figure family holding balloons. He put on the wig. "Fine. But I'm not doing the card trick." "Thank you." "And if there's a pony, we're taking the pony." "Deal."
created_at
"2026-04-24T13:29:30.691Z"
4/24/2026, 1:29:30 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
852
moderation_latency
(null)
generation_time
15967
tokens_prompt
77
tokens_completion
788
native_tokens_prompt
98
native_tokens_completion
1138
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02894
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"c45d1462-52e6-4901-bec9-07324a2e7824"
is_byok
false
latency
852
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037370-1dyyCJEa0g5xLysggxG4"
session_id
(null)
api_type
"completions"
id
"gen-1777037370-JjLh5aElIiiYeWX3zh0u"
upstream_id
"c45d1462-52e6-4901-bec9-07324a2e7824"
total_cost
0.02894
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
14.30%
Matches word count
n/a
neededClean
false
words
479
3.57%
Dialogue to Total Word Ratio
Ratio: 78.16%, Deviation: 8.16%
neededClean
false
wordsTotal
490
wordsDialogue
383
8.9369%