NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KQ0GA5JFB9GE1JZRDKBK8XGX
Run Details
Model:
Claude Opus 4.7 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
16.1s
Cost:
$0.029340
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
1.04%
Matches word count
(details)
0.08%
Dialogue to Total Word Ratio
(details)
0.5595%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the kitchen table and weighed down the corners with half-empty coffee mugs. Bill leaned over, squinting, a jelly donut hovering dangerously close to the paper. "Watch the jelly," Molly said. "I'm watching it." "You're not watching it. It's about to drip on the bouncy castle." Bill took a defensive bite. "Okay. Walk me through it." "Right. The party starts at two. Birthday boy is one Tyler Bernstein, turning seven. Theme is dinosaurs." "Dinosaurs are hot right now." "Bill." "I'm just saying. Kid's got taste." Molly tapped the blueprint. "Backyard here. Gift table here, under the gazebo. Our window is when they bring out the cake. All eyes on candles, nobody's watching the loot." "How many gifts we talking?" "RSVP list says twenty-two kids. Average party gift runs forty bucks, but this is the Bernsteins, so figure sixty. That's thirteen hundred in merchandise, retail." Bill whistled. "For a seven-year-old." "For a seven-year-old whose dad is a dentist." "See, that's the thing that gets me. Dentist money. It's like, who's earning that? Kids with cavities. So really, we're stealing from the cavities." "That's beautiful, Bill. Write that on a card." "I'm just rationalizing." "Rationalize faster. We need a distraction." Bill chewed thoughtfully. "What if I dress as a clown?" "No." "Magician?" "No." "Mime?" "Bill, I swear on my mother, if you say mime one more time." "What about the dog?" Molly paused. "What dog?" "The Bernsteins have a golden retriever. Saw it on their Christmas card, the one we stole off their porch last December." "Gus." "Gus. Right. So we slip Gus some peanut butter, Gus goes nuts, Gus runs through the party, kids chase Gus, we grab the gifts and walk out the side gate like two respectable adults." Molly stared at him. "That's... actually not terrible." "I have ideas sometimes." "You have one idea. We've been using it for six years." "And it works." "It worked once. In Cleveland. Barely." Bill shrugged and finished the donut. "What's the getaway?" "Minivan. I rented a Sienna." "A Sienna. You got me riding in a Sienna." "Nobody suspects the Sienna, Bill. Cop sees a Sienna, cop sees a soccer mom. Cop sees your Camaro, cop sees a guy who robs children." "I robbed one child. One time." "You robbed a piñata full of twenties at a bar mitzvah." "That's not a child, that's a papier-mâché donkey." Molly rolled up the blueprint. "Two o'clock. Peanut butter. Sienna. We're in, we're out, we're at the pawn shop by four." "And the gift bags?" "What about them?" Bill looked almost shy. "Can I keep one? I like the little whistles." Molly sighed, the long sigh of a woman who had chosen her partner and would live with that choice. "Yes, Bill. You can keep a gift bag." "Thanks, Mol." "Don't mention it. Ever. To anyone."
created_at
"2026-04-24T19:43:49.331Z"
4/24/2026, 7:43:49 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
904
moderation_latency
(null)
generation_time
15723
tokens_prompt
77
tokens_completion
784
native_tokens_prompt
98
native_tokens_completion
1154
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.02934
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"2d68b7f1-eb6b-4b49-8265-34e76f427e50"
is_byok
false
latency
904
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777059829-fkXl13tu02yjFI9UjXSb"
session_id
(null)
api_type
"completions"
id
"gen-1777059829-PZ4Dn3kN9SipvLHVuH1h"
upstream_id
"2d68b7f1-eb6b-4b49-8265-34e76f427e50"
total_cost
0.02934
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
1.04%
Matches word count
n/a
neededClean
false
words
474
0.08%
Dialogue to Total Word Ratio
Ratio: 81.91%, Deviation: 11.91%
neededClean
false
wordsTotal
481
wordsDialogue
394
0.5595%