Run: 2026-03-11-01KKFF797296PP5MHXJDEK1QZ5

Run Details

Duration:
5.0s
Cost:
$0.000000
Provider:
Stealth

Evaluation summary

Score Evaluator
Details Link
98.71% Matches word count (details)
24.71% Dialogue to Total Word Ratio (details)
61.7101% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 90% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly spread the blueprint across the table and tapped it with her fingernail. "Security's tight. Balloon animals at every entrance." Bill squinted. "Are they armed?" "They're clowns, Bill. What do you think?" "I think we're underestimating clowns." He pulled out a tiny notebook. "Okay, what's the target?" "Table four. Near the bathroom. It's a mountain of presents, and nobody's watching it because they're all distracted by the magician." "Kid magician or adult magician?" "Makes a difference?" "Enormous difference. Adults know slight of hand. Kids just shove rabbits into hats and hope." Molly pinched the bridge of her nose. "Fine. While the magician's doing his thing, we move in. You distract the clown, I grab the bags." "What's my distraction?" "Sing 'Happy Birthday.' Loudly. Off key." Bill stood up with dramatic confidence. "That I can do. I once brought a karaoke machine to a funeral." "Exactly why you're perfect for this." Molly rolled up the blueprint. "We go at two. After cake, before the piñata. Everyone goes feral during the piñata." "So we just walk out carrying two dozen wrapped boxes past forty screaming children?" "Now you're getting it." Bill cracked his knuckles. "This is the worst plan we've ever had." "And we once stole a canoe."
created_at"2026-03-11T22:10:35.376Z" 3/11/2026, 10:10:35 PM
model"openrouter/healer-alpha"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency589
moderation_latency(null)
generation_time4798
tokens_prompt77
tokens_completion432
native_tokens_prompt2147
native_tokens_completion377
native_tokens_completion_images(null)
native_tokens_reasoning86
native_tokens_cached2145
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0
router(null)
provider_responses(null)
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773267035-KFNYA6JmCWuUEkfR2ddt"
upstream_id"gen-1773267035-KFNYA6JmCWuUEkfR2ddt"
total_cost0
cache_discount(null)
upstream_inference_cost0
provider_name"Stealth"

Evaluation details

Result Evaluator Details Meta Data
98.71% Matches word count n/a
neededCleanfalse
words206
24.71% Dialogue to Total Word Ratio
Ratio: 79.13%, Deviation: 10.87%
neededCleanfalse
wordsTotal206
wordsDialogue163
61.7101%