NC Bench
Overview
About
Tests
Categories
Models
Vendors
Compare
Run: 2026-03-11-01KKFC8RD4RAFS9VT40P0T1GAF
Run Details
Model:
Stealth: Healer Alpha
Test:
Dialogue tags
Scenario:
Write 500 words with 70% dialogue
Duration:
9.3s
Cost:
$0.000000
Provider:
Stealth
Evaluation summary
Score
Evaluator
Details Link
99.98%
Matches word count
(details)
0.38%
Dialogue to Total Word Ratio
(details)
50.1834%
Overall
Model input
User/Human
You are a author of comedic heist books. Please write a 500 word scene that has 70% words of dialogue between two thieves named Molly and Bill who are planning to break into a children's party to steal all the presents. Only include the prose of the scene without any markdown or commentary.
Model response
Messages
Meta
AI
Molly spread the blueprint across the kitchen table, her finger tracing the perimeter of the Johnson’s backyard. “The drop zone is here, behind the inflatable castle. You’ll scale the fence on the eastern side, near the petunias.” Bill squinted at the drawing. “Petunias? Molly, we’re professionals. We don’t plan heists around horticulture.” “The petunias are crucial, Bill. They’re Mrs. Johnson’s pride and joy. If you trample them, she’ll call the cops before we even touch the loot.” Molly tapped a cluster of circles on the blueprint. “These are the targets. The gifts. I count fifteen, maybe twenty.” Bill leaned back, crossing his arms. “Twenty. For a seven-year-old’s birthday party. That’s… excessive.” “It’s a goldmine. And not just any goldmine. It’s a goldmine of untraceable, in-the-box joy. No serial numbers on a brand-new bicycle, Bill. No fingerprints on a sealed video game console.” “Fine. But the security. What’s the threat assessment?” Bill asked, rubbing his chin. “Two layers. First, the parents. They’ll be in the kitchen, drinking cheap prosecco and complaining about the cost of sheet cake. They are predictable. Low threat. The second layer,” Molly said, lowering her voice, “is the chaos layer. The children.” “Ah. The tiny, sticky agents of disorder.” “Exactly. They’ll be hopped up on sugar, careening between the bouncy castle and the craft table. Our window is during the ceremonial singing of ‘Happy Birthday.’ They’ll be distracted by the sheer, radiant force of Marcus’s candle-smothering breath.” “And our escape?” “The slide. We take the gifts, load them into the laundry baskets we’ve stashed, and exit via the playground slide. It’s a clean descent into the alleyway where the van will be idling.” Bill stood up, pacing. “I have concerns. One: laundry baskets? Really? Two: the slide? It’s designed for a three-foot drop, not a fifty-thousand-dollar haul of plastic.” “The baskets are inconspicuous! The slide is our only elevated exit point! Do you have a better idea, Bill? Shall we just walk out the front door and ask for gift receipts?” “I’m just saying, it feels a bit… domestic.” Bill sighed, running a hand through his hair. “Can’t we steal something with a little more gravitas? Art? A Fabergé egg?” “Those come with alarms, Bill. And guards. This comes with a dog named Biscuit and a perimeter secured by a garden gnome.” Molly softened her tone. “Look, the money’s good. The risk is minimal. And we get to ruin a seven-year-old’s birthday. What’s not to love?” Bill was quiet for a moment, staring at the blueprint. “What if… what if there are no good gifts? What if it’s all, like, educational workbooks and artisanal wooden puzzles?” “Then we take them and we’re grateful,” Molly said firmly. “A heist is a heist.” “Fine.” Bill cracked his knuckles. “I’ll be in the van. You handle the gnome.” Molly grinned, rolling up the blueprint. “That’s the spirit. And Bill? Don’t touch the petunias.” “No promises,” he muttered, already calculating the velocity of a laundry basket full of Legos.
created_at
"2026-03-11T21:18:57.968Z"
3/11/2026, 9:18:57 PM
model
"openrouter/healer-alpha"
app_id
182717
external_user
(null)
streamed
true
cancelled
false
latency
951
moderation_latency
(null)
generation_time
9179
tokens_prompt
77
tokens_completion
1122
native_tokens_prompt
2187
native_tokens_completion
929
native_tokens_completion_images
(null)
native_tokens_reasoning
263
native_tokens_cached
2132
num_media_prompt
(null)
num_input_audio_prompt
(null)
num_media_completion
0
num_search_results
(null)
origin
"https://ncbench.com/"
is_byok
false
finish_reason
"stop"
native_finish_reason
"stop"
usage
0
router
(null)
provider_responses
(null)
user_agent
"langchainjs-openai/1.0.0 ((node/v24.14.0; linux; x64))"
http_referer
(null)
api_type
"completions"
id
"gen-1773263937-ZUvETFItX4GkR2vpDH6y"
upstream_id
"gen-1773263937-ZUvETFItX4GkR2vpDH6y"
total_cost
0
cache_discount
(null)
upstream_inference_cost
0
provider_name
"Stealth"
Evaluation details
Result
Evaluator
Details
Meta Data
99.98%
Matches word count
n/a
neededClean
false
words
498
0.38%
Dialogue to Total Word Ratio
Ratio: 80.55%, Deviation: 10.55%
neededClean
false
wordsTotal
509
wordsDialogue
410
50.1834%