NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-04-24-01KPZTXP92P4EHZA78HNV2GEA4
Run Details
Model:
Claude Opus 4.7
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
15.6s
Cost:
$0.027465
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
99.38%
Matches word count
(details)
0.00%
Dialogue to Total Word Ratio
(details)
49.6885%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly unrolled the blueprint across the sticky diner table, pinning the corners down with a ketchup bottle, a sugar dispenser, and two half-eaten pancakes. Bill leaned over, squinting at the crayon-drawn map like it was the Rosetta Stone. "Why is the living room shaped like a kidney bean?" he asked. "Because I drew it from memory, and I had a kidney bean for lunch. Focus." The diner hummed around them. A waitress refilled their coffee without asking, which Molly took as a sign from the universe that their plan was blessed. Bill tapped the map with a greasy finger. "Okay, so. The target is a seven-year-old named Tyler. Theme is dinosaurs. Forty guests, give or take a screaming cousin. Presents get piled in the corner of the den at approximately two-fifteen, right after cake." "How do you know all this?" "I'm on the evite list." Molly looked up slowly. "You're what?" "I may have infiltrated the mom chat." "Bill. Bill, look at me. You joined a mom chat." "Under the name Cheryl. Cheryl brings a surprisingly good seven-layer dip, and nobody asks questions about Cheryl." Molly rubbed her temples, which had started to throb in the specific rhythm that only Bill's decisions could produce. Still, she had to admit, intel was intel. She circled the den on the map with a red pen. "Entry point is the sliding glass door off the patio. The dad always forgets to lock it because he's too busy manning the grill and explaining his smoker to men who don't care." "What about the clown?" "What clown?" "There's a clown. Cheryl sent condolences in the group chat." "Of course there's a clown." Molly exhaled through her teeth. "Fine. The clown is a variable. We neutralize him with a balloon animal request so complicated it takes his whole soul to complete. I'm thinking a giraffe riding a motorcycle." "Diabolical." "I have my moments." Bill pulled a gym bag from under the booth and unzipped it just enough to show a neatly folded pile of T-Rex costumes. Molly stared. "Camouflage," he whispered. "Nobody looks twice at a dinosaur at a dinosaur party." "Bill, those costumes have tiny arms. How are we supposed to carry the presents?" There was a long silence. Bill's face did a slow, tragic journey from pride to devastation. "I didn't think about the arms," he said quietly. "You didn't think about the arms." "The arms didn't occur to me, Molly." She took a long, slow sip of coffee, the kind of sip that bought a woman time to reconsider her entire career. Outside, a minivan honked. A child wailed somewhere in the parking lot, possibly in celebration, possibly in protest. Molly set down her mug. "We go as parents," she said finally. "Khakis. Sensible polos. A folding chair under one arm. Nobody questions a tired-looking adult at a kid's party. We'll walk out with the loot and everyone will assume we're Tyler's uncles." Bill grinned. "Cheryl would approve."
created_at
"2026-04-24T13:30:00.366Z"
4/24/2026, 1:30:00 PM
model
"anthropic/claude-4.7-opus-20260416"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
724
moderation_latency
(null)
generation_time
15574
tokens_prompt
77
tokens_completion
768
native_tokens_prompt
98
native_tokens_completion
1079
native_tokens_completion_images
(null)
native_tokens_reasoning
0
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
num_fetches
(null)
web_search_engine
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"end_turn"
usage
0.027465
router
(null)
provider_responses
0
endpoint_id
"0fa65117-00d9-4835-808d-9b68041d9ce4"
id
"7e60f744-70ca-454c-a3f2-835dbe3499c9"
is_byok
false
latency
724
model_permaslug
"anthropic/claude-4.7-opus-20260416"
provider_name
"Amazon Bedrock"
status
200
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.1; linux; x64))"
http_referer
(null)
request_id
"req-1777037400-7hZNGqgPmvJ5DbE2xkJz"
session_id
(null)
api_type
"completions"
id
"gen-1777037400-ikmxZxNfDmSDBwylnupf"
upstream_id
"7e60f744-70ca-454c-a3f2-835dbe3499c9"
total_cost
0.027465
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
response_cache_source_id
(null)
Evaluation details
Result
Evaluator
Details
Meta Data
99.38%
Matches word count
n/a
neededClean
false
words
495
0.00%
Dialogue to Total Word Ratio
Ratio: 56.37%, Deviation: 26.37%
neededClean
false
wordsTotal
502
wordsDialogue
283
49.6885%