Run: 2026-03-11-01KKFGVB8Q6EZ8ZRE2C1E5RGRS

Run Details

Duration:
4.2s
Cost:
$0.000000
Provider:
Stealth

Evaluation summary

Score Evaluator
Details Link
93.65% Matches word count (details)
0.00% Dialogue to Total Word Ratio (details)
46.8248% Overall

Model input

User/Human
You are a author of comedic heist books. Please write a 200 word scene that has 10% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.

Model response

AI
The smell of bubble solution and synthetic strawberry hung heavy in the air as Molly and Bill crouched behind a life-sized cardboard unicorn. Rainbows and teddy bears covered every surface, a pastel fortress of joy. Bill adjusted his balaclava, which had a small hole for a strand of his perpetually messy hair. “It’s worse than we thought,” Molly whispered, gesturing at a fortress of wrapped boxes beneath a shimmering “Happy Birthday!” banner. “They’re all here. The mothers. The tiny, sticky overlords.” “We need the Element of Surprise,” Bill murmured, eyes gleaming. “And the Element of Distraction.” “Right. You cause the distraction. I’ll handle the extraction.” “How?” Molly reached into her tactical pouch and pulled out a small, plastic dinosaur. “I’ll roll this into the middle of the room. They’re genetically programmed to chase it.” Bill grinned, a slow, dangerous curve. “You’re a genius.” “Just focus. Don’t get distracted by the cake.” They moved like shadows through a storm of glitter. Bill sidled toward the stereo system, his fingers dancing over the controls, while Molly slithered toward the presents, a silent panther in tactical yoga pants. Just as Bill’s finger hovered over the playlist, a chorus of high-pitched voices shrieked, “THE DINO IS ESCAPING!” Molly froze. So much for stealth.
created_at"2026-03-11T22:39:01.412Z" 3/11/2026, 10:39:01 PM
model"openrouter/healer-alpha"
app_id182717
external_user(null)
streamedtrue
cancelledfalse
latency567
moderation_latency(null)
generation_time4049
tokens_prompt77
tokens_completion555
native_tokens_prompt2147
native_tokens_completion450
native_tokens_completion_images(null)
native_tokens_reasoning188
native_tokens_cached2145
num_media_prompt(null)
num_input_audio_prompt(null)
num_media_completion0
num_search_results(null)
origin"https://ncbench.com/"
is_byokfalse
finish_reason"stop"
native_finish_reason"stop"
usage0
router(null)
provider_responses(null)
user_agent"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer(null)
api_type"completions"
id"gen-1773268741-0d1icLyHSK7xWxGNAFuD"
upstream_id"gen-1773268741-0d1icLyHSK7xWxGNAFuD"
total_cost0
cache_discount(null)
upstream_inference_cost0
provider_name"Stealth"

Evaluation details

Result Evaluator Details Meta Data
93.65% Matches word count n/a
neededCleanfalse
words209
0.00% Dialogue to Total Word Ratio
Ratio: 31.75%, Deviation: 21.75%
neededCleanfalse
wordsTotal211
wordsDialogue67
46.8248%