Run: 2026-02-24-01KJ8WNY07MAH6HZ8WB01MCSAM

Run Details

Duration:
10.6s
Cost:
$0.010010
Provider:
Amazon Bedrock

Evaluation summary

Score Evaluator
Details Link
99.92% Matches word count (details)
0.14% Dialogue to Total Word Ratio (details)
50.0285% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 50% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly spread the hand-drawn map across the dashboard, which appeared to be crayon on construction paper. She had stolen it from the birthday boy's cubby at preschool, and it smelled faintly of juice boxes. "The bounce house is here," she said, tapping a purple blob. "That's our primary obstacle. Kids go feral in those things." Bill squinted at the map and turned it upside down. He was wearing all black, which seemed excessive for a Tuesday afternoon in the suburbs. "What about the clown?" he asked. "The clown is a wildcard. We avoid the clown at all costs." Bill nodded slowly, cracking his knuckles with the gravity of a man about to defuse a bomb rather than rob a six-year-old. "I'll create a distraction near the cake table," he said. "You grab the presents from the folding table by the fence." "And if the mother spots us?" "We tell her we're from the balloon company." Molly pulled two helium tanks from the backseat, grinning with the pride of someone who had thought of everything. The party started in twenty minutes. Somewhere inside that backyard sat a mountain of wrapped gifts just begging to be liberated. "Let's go steal from some children," she whispered.
created_at"2026-02-24T22:35:24.047Z" 2/24/2026, 10:35:24 PM
model"anthropic/claude-4.6-opus-20260205"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1466
moderation_latency171
generation_time10350
tokens_prompt77
tokens_completion382
native_tokens_prompt92
native_tokens_completion382
native_tokens_completion_images(null)
native_tokens_reasoning68
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.01001
router(null)
provider_responses
0
endpoint_id"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
id"905f58dc-56fb-44fa-ab23-703273a8ee3b"
is_byokfalse
latency1425
model_permaslug"anthropic/claude-4.6-opus-20260205"
provider_name"Amazon Bedrock"
status200
api_type"completions"
id"gen-1771972524-UngVoDw6RULPaUyytE5i"
upstream_id"905f58dc-56fb-44fa-ab23-703273a8ee3b"
total_cost0.01001
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock"

Evaluation details

Result Evaluator Details Meta Data
99.92% Matches word count n/a
neededCleanfalse
words203
0.14% Dialogue to Total Word Ratio
Ratio: 33.98%, Deviation: 16.02%
neededCleanfalse
wordsTotal206
wordsDialogue70
50.0285%