NC Bench
Overview
About
Tests
Models
Vendors
Compare
Run: 2026-02-24-01KJ8TZTH8BSEKVB3PZY6ANWBB
Run Details
Model:
Claude Sonnet 4.6 (Reasoning)
Test:
Dialogue tags
Scenario:
Write 500 words with 30% dialogue
Duration:
2m 20s
Cost:
$0.207441
Provider:
Amazon Bedrock
Evaluation summary
Score
Evaluator
Details Link
35.00%
Matches word count
(details)
48.31%
Dialogue to Total Word Ratio
(details)
41.6570%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 30% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprints across the hood of their battered Ford Fiesta, using a juice box and a half-eaten granola bar to hold down the corners. The blueprints were technically a crayon drawing lifted from a birthday party invitation, but she had augmented it with arrows and the word INGRESS in red marker, which gave the whole operation a professional dignity it sorely needed. Bill stared at the drawing. "That's just a bouncy castle," he said. "That's the target. The gift table sits immediately to the left of the bouncy castle, which puts our extraction point right here." She tapped the paper with two fingers. "Simple." "Nothing about this is simple, Molly." "Small children. Distracted parents. Cake. It is extremely simple." Bill tugged at his black turtleneck, already soaked through because it was eleven in the morning and seventy-eight degrees outside. He had worn the turtleneck because Molly had insisted on full tactical dress, and he had not paused to ask what a children's birthday party required in terms of tactical dress. He was beginning to understand that the answer was not this. The party was underway three houses down. They could hear the aggressive synthetic joy of a pop song built entirely for children who had not yet developed taste, and beneath that, the relentless percussion of small humans running on hardwood floors. A drooping banner between two oak trees read HAPPY 7TH BIRTHDAY COOPER. "We move in during the candles," Molly said. "Every adult in that yard will be looking at Cooper." "Who even is Cooper?" "Irrelevant to the mission." Bill picked up the juice box from the corner of the blueprints. It was grape. He took a long pull from the straw and set it back down with the careful deliberateness of a man who needed something to do with his hands. "What if someone asks what we're doing?" he said. "We say we are the cleanup crew." "People believe that?" "Parents at children's parties will believe anything that implies another adult is handling something. They are running entirely on caffeine and quiet desperation." Bill realized, with some discomfort, that this was completely accurate. "I still feel like this crosses a line," he said. "Every heist crosses a line, Bill. That is what makes it a heist." He gazed down the street at the yard full of balloons and streamers and small people who had done absolutely nothing to deserve what was about to happen to their gift pile. He straightened his turtleneck. He finished the juice box. "Cooper is going to be devastated," he said. "Cooper will recover. Children are resilient. Also, someone in there almost certainly bought a gift card, and gift cards are non-transferable, so Cooper is keeping something." She folded the blueprints and tucked them under her arm. "Ready?" Bill looked at her for a long moment. "Absolutely not," he said. Molly was already walking.
created_at
"2026-02-24T22:05:51.026Z"
2/24/2026, 10:05:51 PM
model
"anthropic/claude-4.6-sonnet-20260217"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
783
moderation_latency
142
generation_time
139633
tokens_prompt
77
tokens_completion
1838
native_tokens_prompt
92
native_tokens_completion
13811
native_tokens_completion_images
(null)
native_tokens_reasoning
1078
native_tokens_cached
0
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0.207441
router
(null)
provider_responses
0
endpoint_id
"83d802cb-44f2-45a0-a9d5-cb9bf79777c8"
id
"b0cfffc3-7d22-4236-b436-8c10097fccdf"
is_byok
false
latency
783
model_permaslug
"anthropic/claude-4.6-sonnet-20260217"
provider_name
"Amazon Bedrock"
status
200
api_type
"completions"
id
"gen-1771970751-UJtNf6AboY8xDGT9gUZv"
upstream_id
"b0cfffc3-7d22-4236-b436-8c10097fccdf"
total_cost
0.207441
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Amazon Bedrock"
Evaluation details
Result
Evaluator
Details
Meta Data
35.00%
Matches word count
n/a
neededClean
false
words
482
48.31%
Dialogue to Total Word Ratio
Ratio: 33.81%, Deviation: 3.81%
neededClean
false
wordsTotal
485
wordsDialogue
164
41.6570%