Run: 2026-02-24-01KJ8WQQBFPB79BAP24KYD0Q8M

Run Details

Duration:
9.7s
Cost:
$0.006321
Provider:
Anthropic

Evaluation summary

Score Evaluator
Details Link
43.38% Matches word count (details)
100.00% Dialogue to Total Word Ratio (details)
71.6892% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
"The bouncy castle is our biggest problem," said Molly. "Why?" "Because you'll want to go in it." "I won't want to go in it." "Bill, you went in the one at the Henderson job." "That was recon." "You were in there for forty minutes." "It was a very good bouncy castle." Molly unrolled the hand-drawn map across the hood of their van. "Focus. The presents are stacked behind the gift table, here. The clown is stationed near the entrance, here. And the birthday kid patrols the perimeter like a tiny angry general." "How old is this kid?" "Six." "Terrifying." "Exactly. Now, I'll create a distraction by loosening the string on the helium balloons. Every child within twenty feet will lose their mind completely." "Genius." "While that happens, you load the presents into the bags and we're out before anyone sings happy birthday." "What if the clown spots us?" "Nobody looks directly at a clown, Bill. It's a human instinct." "What about the cake?" Molly stared at him. "We are not stealing the cake." "I'm just saying it looked really good in the photos."
created_at"2026-02-24T22:36:22.776Z" 2/24/2026, 10:36:22 PM
model"anthropic/claude-4.6-sonnet-20260217"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency779
moderation_latency(null)
generation_time9688
tokens_prompt77
tokens_completion387
native_tokens_prompt92
native_tokens_completion403
native_tokens_completion_images(null)
native_tokens_reasoning90
native_tokens_cached0
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0.006321
router(null)
provider_responses
0
endpoint_id"6d408764-7dd7-4626-bb87-a6cc1589bc86"
id"msg_01EUwVjhHTR7rosT9nGb1Rdw"
is_byokfalse
latency776
model_permaslug"anthropic/claude-4.6-sonnet-20260217"
provider_name"Anthropic"
status200
api_type"completions"
id"gen-1771972582-jSG8phfwRmWw2316kTXh"
upstream_id"msg_01EUwVjhHTR7rosT9nGb1Rdw"
total_cost0.006321
cache_discount(null)
upstream_inference_cost0
provider_name"Anthropic"

Evaluation details

Result Evaluator Details Meta Data
43.38% Matches word count n/a
neededCleanfalse
words183
100.00% Dialogue to Total Word Ratio
Ratio: 90.22%, Deviation: 0.22%
neededCleanfalse
wordsTotal184
wordsDialogue166
71.6892%