Run: 2026-02-08-01KGYXGMP5YXFTXM22BD02XP6J

Run Details

Duration:
9.0s
Cost:
$0.008100
Provider:
Amazon Bedrock

Evaluation summary

Score Evaluator
Details Link
60.28% Matches word count (details)
97.87% Dialogue to Total Word Ratio (details)
79.0734% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly spread the crayon-drawn map across the dashboard and tapped the big pink house. "The party starts at two. That gives us a forty-minute window while they're doing the piñata in the backyard." "I don't like piñatas," Bill said. "One hit me in the face once." "Focus, Bill." "I'm just saying. Papier-mâché is deceptively heavy." "The presents will be stacked on a folding table in the living room. We go in through the side door, load the bags, and we're out before they sing Happy Birthday." "What if a kid sees us?" "We're wearing the clown costumes." "I'm not wearing the clown costume, Molly." "You're wearing the clown costume." "Last time I wore that thing, a toddler bit me." "You'll survive. Now, the mother ordered a bouncy castle, which means every adult will be distracted watching kids bounce into each other. It's the perfect cover." "What kind of presents are we talking here?" "LEGOs, Bill. Mountains of LEGOs." "LEGOs hurt when you step on them." "Everything hurts you, Bill. Are you in or not?" Bill sighed and picked up the red foam nose. "I'm in."
created_at"2026-02-08T15:22:02.790625+00:00" 2/8/2026, 3:22:02 PM
model"anthropic/claude-4.6-opus-20260205"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency1261
moderation_latency253
generation_time8670
tokens_prompt61
tokens_completion254
native_tokens_prompt75
native_tokens_completion309
native_tokens_completion_images(null)
native_tokens_reasoning0
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.0081
router(null)
provider_responses
0
id"ba275558-d95d-473d-8d1c-cbb4651f137b"
status200
is_byokfalse
latency1261
endpoint_id"a8708fb9-08ea-43bb-a3b1-165c87ef09df"
provider_name"Amazon Bedrock"
model_permaslug"anthropic/claude-4.6-opus-20260205"
api_type"completions"
id"gen-1770564113-tyAGiBlaLafrphsiwDNB"
upstream_id"ba275558-d95d-473d-8d1c-cbb4651f137b"
total_cost0.0081
cache_discount(null)
upstream_inference_cost0
provider_name"Amazon Bedrock"

Evaluation details

Result Evaluator Details Meta Data
60.28% Matches word count n/a
neededCleanfalse
words185
97.87% Dialogue to Total Word Ratio
Ratio: 86.17%, Deviation: 3.83%
neededCleanfalse
wordsTotal188
wordsDialogue162
79.0734%