Run: 2026-03-11-01KKFBS072DAT6RY77QBY5YMAW

Run Details

Duration:
10.9s
Cost:
$0.000000
Provider:
Stealth

Evaluation summary

Score Evaluator
Details Link
43.38% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
21.6892% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
Molly adjusted the rearview mirror, not to check for police but to examine a suspicious smudge on her black turtleneck. The van smelled faintly of powdered sugar and betrayal. Bill, ever the optimist, studied the blueprint of the Sunshine Meadows Community Center with a jeweler's loupe, though the schematics were a crayon drawing he'd convinced a bribed five-year-old to make. "The entry point is the bouncy castle vent," he whispered, tapping a wobbly pink crayon line. "Classic soft target." "And the exit strategy?" Molly asked, not looking up from her smudge. "We blend with the fleeing toddlers at the first sign of cake. Chaos is our cloaking device." She finally met his gaze in the mirror. "Our getaway vehicle is an ice cream truck, Bill. It plays 'Pop Goes the Weasel' at fifteen decibels." "Exactly! Auditory camouflage. They'll never suspect the jingle is carrying six hundred dollars' worth of Lego sets." He folded the blueprint with the reverence of a holy text. "The presents are in the 'Present Cave' behind the magician. We go in during the magic show's finale—when the rabbit comes out of the hat. Maximum distraction." "Maximum insanity," Molly corrected, but she was already pulling on her gloves. The van's engine coughed to life, somehow smelling even more of betrayal and cheap frosting.
created_at"2026-03-11T21:10:21.679Z" 3/11/2026, 9:10:21 PM
model"openrouter/hunter-alpha"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency853
moderation_latency(null)
generation_time10830
tokens_prompt77
tokens_completion629
native_tokens_prompt2111
native_tokens_completion520
native_tokens_completion_images(null)
native_tokens_reasoning289
native_tokens_cached1984
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0
router(null)
provider_responses(null)
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773263421-F3REeXMljNUGZNhwkKD1"
upstream_id"gen-1773263421-F3REeXMljNUGZNhwkKD1"
total_cost0
cache_discount(null)
upstream_inference_cost0
provider_name"Stealth"

Evaluation details

Result Evaluator Details Meta Data
43.38% Matches word count n/a
neededCleanfalse
words217
0.00% Dialogue to Total Word Ratio
Ratio: 44.29%, Deviation: 34.29%
neededCleanfalse
wordsTotal219
wordsDialogue97
21.6892%