NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0G3SFR5FC4HD94MTWPBGY1
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.6s
Cost:
$0.029365
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
43.38%
Matches word count
(details)
1.66%
Dialogue to Total Word Ratio
(details)
22.5178%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the sticky diner table, anchoring the corners with ketchup bottles and a half-eaten plate of hash browns. "Okay, Bill. The target is a single-story colonial in Scarsdale. Birthday girl turns seven on Saturday. Theme is unicorns." Bill squinted at the paper. "That's not a blueprint. That's a Chuck E. Cheese placemat." "It's the spirit of a blueprint." "Molly." "Bill." "We are grown adults," he said, slowly, "planning to rob a child." "We're planning to rob a *party*. There's a distinction." "Is there?" "Legally? Probably not. Morally? Absolutely." She tapped the placemat. "Think about it. These kids get spoiled rotten. A dozen presents each? It builds character to lose a few." "You sound like my father." "Was your father a thief?" "He was an accountant, which is worse." Molly rolled her eyes and produced a spiral notebook labeled OPERATION: PIÑATA. "Here's the plan. You enter through the back gate dressed as the clown." "Absolutely not." "Bill." "I have a documented phobia." "Of clowns?" "Of *being* one. It's a dignity thing." "You stole a kidney from a hospital last April." "Yeah, and I wore scrubs. Scrubs are professional. Clowns are a cry for help." "Fine." She scratched something out. "You're the magician." "Better." "You do the magic show in the living room. Pull some scarves, saw a stuffed animal in half, whatever. Meanwhile, I slip into the dining room where they stack the gifts, and I bag them." "In what?" "A giant sack." "A *sack*, Molly? Like the Grinch?" "The Grinch had a functional model, Bill. You don't mess with a classic." He pinched the bridge of his nose. "How do we get out?" "Minivan. I rented one. It's maroon. Very mom." "And if someone notices the presents are missing?" "That's the beauty part." Molly leaned in. "Kids are stupid. I mean, sweet. But stupid. By the time they realize, we're on the Merritt Parkway eating McNuggets." "What about the parents?" "Day-drunk on chardonnay by two p.m. Trust me, I cased the mom at Pilates. She's a mess." Bill stared at her for a long moment. "You've been reconning a Pilates class." "For three weeks. My core has never looked better." "This is the saddest sentence I've ever heard." "Are you in or are you out?" He sighed the sigh of a man whose life had taken a specific and unfortunate turn. "What's the take?" "Conservative estimate? Twelve LOL Surprise Dolls, four Nintendo Switches, at least one of those singing Elsa dresses, and a check from a grandparent who doesn't understand gift etiquette." "Resale?" "Eight grand. Maybe nine if we hit a good flea market." Bill drummed his fingers on the table. "I want extra for the magician thing." "Twenty percent bump." "Thirty." "Twenty-five, and I'll buy the rabbit." "Deal." He shook her hand across the hash browns. "God help us." "God's not invited," Molly said. "It's a unicorn party."
created_at
"2026-04-24T19:40:20.347Z"
4/24/2026, 7:40:20 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
693
moderation_latency
(null)
generation_time
16590
tokens_prompt
77
tokens_completion
800
native_tokens_prompt
98
native_tokens_completion
1155
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.029365
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"09d8df38-9814-4550-a44f-3217b7b13c67"
is_byok
false
latency
693
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059620-d9SQYRekQO4Bk4Aypl7X"
session_id
(null)
api_type
"completions"
id
"gen-1777059620-DeelJ5w4S8PBgBsuJDMm"
upstream_id
"09d8df38-9814-4550-a44f-3217b7b13c67"
total_cost
0.029365
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
43.38%
Matches word count
n/a
neededClean
false
words
483
1.66%
Dialogue to Total Word Ratio
Ratio: 79.06%, Deviation: 9.06%
neededClean
false
wordsTotal
487
wordsDialogue
385
22.5178%